Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

98

ICML
2003
IEEE

favoriteEmaildiscussreport

151views Machine Learning» more ICML 2003»

Hierarchical Policy Gradient Algorithms

16 years 2 months ago

Hierarchical Policy Gradient Algorithms

Download www.hpl.hp.com

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning (PGRL) methods have received recent attention as a means to solve problems with continuous state spaces. However, they suffer from slow convergence. In this paper, we combine these two approaches and propose a family of hierarchical policy gradient algorithms for problems with continuous state and/or action spaces. We also introduce a class of hierarchical hybrid algorithms, in which a group of subtasks, usually at the higher-levels of the hierarchy, are formulated as value function-based RL (VFRL) problems and the others as PGRL problems. We demonstrate the performance of our proposed algorithms using a simple taxi-fuel problem and a complex continuous state and action ship steering domain.

Mohammad Ghavamzadeh, Sridhar Mahadevan

Real-time Traffic

Hierarchical Hybrid Algorithms | Hierarchical Reinforcement Learning | ICML 2003 | Machine Learning | Policy Gradient Algorithms |

claim paper

Related Content

» A Unified View of TD Algorithms Introducing FullGradient TD and EquiGradient Descent TD

» Adaptive Stepsize Policy Gradients with Average Reward Metric

» SignaltoNoise Ratio Analysis of Policy Gradient Algorithms

» Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Lear...

» Improving Gradient Estimation by Incorporating Sensor Data

» An extended policy gradient algorithm for robot task learning

» Predictive representations for policy gradient in POMDPs

» Natural ActorCritic

» Covariant Policy Search

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2003
Where	ICML
Authors	Mohammad Ghavamzadeh, Sridhar Mahadevan

Comments (0)