Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

126

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

15 years 8 months ago

Reinforcement Learning in Fine Time Discretization

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet the discretization may be arbitrarily ﬁne. It is shown here that stationary policies, applied by most RL methods, are improper in control applications, since for ﬁne time discretization they can not assure bounded variance of policy gradient estimators. As a remedy to that diﬃculty, we propose the use of piecewise non-Markov policies. Policies of this type can be optimized by means of most RL algorithms, namely those based on likelihood ratio.

Pawel Wawrzynski

Real-time Traffic

ICANNGA 2007 | Policy Gradient Estimators | RL Methods | ﬁne Time Discretization |

claim paper

Related Content

» Efficient Reinforcement Learning with Multiple Reward Functions for Randomized Controlled ...

» An analytic solution to discrete Bayesian reinforcement learning

» A Convergent Reinforcement Learning Algorithm in the Continuous Case The FiniteElement Rei...

» Multidimensional Triangulation and Interpolation for Reinforcement Learning

» Reinforcement Learning SpikeTimeDependent Plasticity and the BCM Rule

» Gaussian Processes in Reinforcement Learning

» Reinforcement learning of a continuous motor sequence with hidden states

» Smoothed Sarsa Reinforcement learning for robot delivery tasks

» A TwoStage Relational Reinforcement Learning with Continuous Actions for Real Service Robo...

Post Info
More Details (n/a)

Added	08 Jun 2010
Updated	08 Jun 2010
Type	Conference
Year	2007
Where	ICANNGA
Authors	Pawel Wawrzynski

Comments (0)