Search Sciweavers | Sciweavers

4544 search results - page 179 / 909

» Reinforcement Learning with Time

228

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 5 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

204

click to vote

DIGITEL
2008
IEEE

236views Artificial Intelligence» more DIGITEL 2008»

Adaptive Educational Games: Providing Non-invasive Personalised Learning Experiences

15 years 9 months ago

Download www.mendeley.com

Educational games have the potential to provide intrinsically motivating learning experiences that immerse and engage the learner. However, the much heralded benefits of education...

Neil Peirce, Owen Conlan, Vincent Wade

claim paper

Read More »

230

click to vote

WWW
2008
ACM

210views Internet Technology» more WWW 2008»

Web video topic discovery and tracking via bipartite graph reinforcement model

16 years 8 months ago

Download www2008.org

Automatic topic discovery and tracking on web-shared videos can greatly benefit both web service providers and end users. Most of current solutions of topic detection and tracking...

Lu Liu, Lifeng Sun, Yong Rui, Yao Shi, Shiqiang Ya...

claim paper

Read More »

200

click to vote

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

15 years 8 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

190

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 8 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

« Prev « First page 179 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers