Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods

15 years 8 months ago

Download books.nips.cc

Learning in real-world domains often requires to deal with continuous state and action spaces. Although many solutions have been proposed to apply Reinforcement Learning algorithms to continuous state problems, the same techniques can be hardly extended to continuous action spaces, where, besides the computation of a good approximation of the value function, a fast method for the identiﬁcation of the highest-valued action is needed. In this paper, we propose a novel actor-critic approach in which the policy of the actor is estimated through sequential Monte Carlo methods. The importance sampling step is performed on the basis of the values learned by the critic, while the resampling step modiﬁes the actor’s policy. The proposed approach has been empirically compared to other learning algorithms into several domains; in this paper, we report results obtained in a control problem consisting of steering a boat across a river.

Alessandro Lazaric, Marcello Restelli, Andrea Bona

Real-time Traffic

Action Spaces | Continuous State | Information Technology | Learning Algorithms | NIPS 2007 |

claim paper

» Sequentially updated Probability Collectives

» Learning to fly by combining reinforcement learning with behavioural cloning

» Active Sequential Learning with Tactile Feedback

» Batch Reinforcement Learning with State Importance

» Incremental Learning of Procedural Planning Knowledge in Challenging Environments

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2007
Where	NIPS
Authors	Alessandro Lazaric, Marcello Restelli, Andrea Bonarini

Comments (0)

Sciweavers

Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods

Action Spaces | Continuous State | Information Technology | Learning Algorithms | NIPS 2007 |

Explore & Download

Productivity Tools

Sciweavers