Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

194

IJCAI
2003

169views Artificial Intelligence» more IJCAI 2003»

Covariant Policy Search

15 years 8 months ago

Covariant Policy Search

Download www.ri.cmu.edu

We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geometric methods. This leads us to propose a natural metric on controller parameterization that results from considering the manifold of probability distributions over paths induced by a stochastic controller. Investigation of this approach leads to a covariant gradient ascent rule. Interesting properties of this rule are discussed, including its relation with actor-critic style reinforcement learning algorithms. The algorithms discussed here are computationally quite efﬁcient and on some interesting problems lead to dramatic performance improvement over noncovariant rules.

J. Andrew Bagnell, Jeff G. Schneider

Real-time Traffic

Gradient | IJCAI 2003 | IJCAI 2007 | Policy Gradient Reinforcement | Reinforcement Learning Algorithms |

claim paper

Related Content

» Relative Entropy Policy Search

» Uncertainty handling CMAES for reinforcement learning

» Similarities and differences between policy gradient methods and evolution strategies

» Acceleration of Covariance Models for Noncoding RNA Search

» SteadyState Selection and Efficient Covariance Matrix Update in the Multiobjective CMAES

» Bayesian actorcritic algorithms

» Region Covariance A Fast Descriptor for Detection and Classification

» Surrogate Constraint Functions for CMA Evolution Strategies

» Training of Support Vector Machines with Mahalanobis Kernels

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	IJCAI
Authors	J. Andrew Bagnell, Jeff G. Schneider

Comments (0)