Search Sciweavers | Sciweavers

121 search results - page 1 / 25

» Investigating practical, linear temporal difference learning

189

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 7 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

170

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 7 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

186

click to vote

HICSS
2006
IEEE

141views Biometrics» more HICSS 2006»

Temporal Implications of Information Technology for Work Practices: Organizing in and for Time in an Emergency Department

16 years 7 days ago

Download sprouts.aisnet.org

We investigate the temporal implications of information technology by examining its use in the work practices of physicians and nurses in an emergency department. We conceptualize...

Zixing Shen, Youngjin Yoo, Kalle Lyytinen

claim paper

Read More »

198

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 4 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

154

click to vote

ICML
2008
IEEE

165views Machine Learning» more ICML 2008»

A worst-case comparison between temporal difference and residual gradient with linear function approximation

16 years 7 months ago

Download www.research.rutgers.edu

Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them ex...

Lihong Li

claim paper

Read More »

« Prev « First page 1 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers