Sciweavers

4544 search results - page 179 / 909
» Reinforcement Learning with Time
Sort
View
CORR
2010
Springer
204views Education» more  CORR 2010»
13 years 8 months ago
Predictive State Temporal Difference Learning
We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identification. In practical applications...
Byron Boots, Geoffrey J. Gordon
DIGITEL
2008
IEEE
14 years 18 hour ago
Adaptive Educational Games: Providing Non-invasive Personalised Learning Experiences
Educational games have the potential to provide intrinsically motivating learning experiences that immerse and engage the learner. However, the much heralded benefits of education...
Neil Peirce, Owen Conlan, Vincent Wade
WWW
2008
ACM
14 years 11 months ago
Web video topic discovery and tracking via bipartite graph reinforcement model
Automatic topic discovery and tracking on web-shared videos can greatly benefit both web service providers and end users. Most of current solutions of topic detection and tracking...
Lu Liu, Lifeng Sun, Yong Rui, Yao Shi, Shiqiang Ya...
NIPS
1996
13 years 11 months ago
Why did TD-Gammon Work?
Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...
Jordan B. Pollack, Alan D. Blair
ICML
2010
IEEE
13 years 11 months ago
Finite-Sample Analysis of LSTD
In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...
Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...