Towards Finite-Sample Convergence of Direct Reinforcement Learning

15 years 1 months ago

Download www.cs.uiuc.edu

Abstract. While direct, model-free reinforcement learning often performs better than model-based approaches in practice, only the latter have yet supported theoretical guarantees for ﬁnite-sample convergence. A major diﬃculty in analyzing the direct approach in an online setting is the absence of a deﬁnitive exploration strategy. We extend the notion of admissibility to direct reinforcement learning and show that standard Q-learning with optimistic initial values and constant learning rate is admissible. The notion justiﬁes the use of a greedy strategy that we believe performs very well in practice and holds theoretical signiﬁcance in deriving ﬁnite-sample convergence for direct reinforcement learning. We present empirical evidence that supports our idea.

Shiau Hong Lim, Gerald DeJong

Real-time Traffic

Direct Reinforcement | ECML 2005 | Reinforcement Learning | ﬁnite-sample Convergence |

claim paper

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	ECML
Authors	Shiau Hong Lim, Gerald DeJong

Comments (0)

Sciweavers

Towards Finite-Sample Convergence of Direct Reinforcement Learning

Direct Reinforcement | ECML 2005 | Reinforcement Learning | ﬁnite-sample Convergence |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers