Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

145

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

15 years 4 months ago

Tracking in Reinforcement Learning

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the environment of the learning agent can be considered as stationary, generalized policy iteration frameworks, because of the interleaving of learning and control, will produce non-stationarity of the evaluated policy and so of its value function. Tracking the optimal solution instead of trying to converge to it is therefore preferable. In this paper, we propose to handle this tracking issue with a Kalman-based temporal difference framework. Complexity and convergence analysis are studied. Empirical investigations of its ability to handle non-stationarity is finally provided.

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

Real-time Traffic

Fair Rl Algorithm | ICONIP 2009 | Information Technology | Kalman-based Temporal Difference | Non-stationary Environments |

claim paper

Related Content

» Tracking value function dynamics to improve reinforcement learning with piecewise linear f...

» Reinforcement Learning for PlatformIndependent Visual Robot Control

» Smoothed Sarsa Reinforcement learning for robot delivery tasks

» Adapting Reinforcement Learning for Trust Effective Modeling in Dynamic Environments

» BayesAdaptive POMDPs

» A computational theory of adaptive behavior based on an evolutionary reinforcement mechani...

» Optimization on a Budget A Reinforcement Learning Approach

» TeXDYNA Hierarchical Reinforcement Learning in Factored MDPs

» QConceptLearning Generalization with Concept Lattice Representation in Reinforcement Learn...

Post Info
More Details (n/a)

Added	19 Feb 2011
Updated	19 Feb 2011
Type	Journal
Year	2009
Where	ICONIP
Authors	Matthieu Geist, Olivier Pietquin, Gabriel Fricout

Comments (0)