policy search | Sciweavers

43

TSMC
2011

258views more TSMC 2011»

Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions

13 years 7 months ago

—This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-l...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...

claim paper

Read More »

38

click to vote

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

14 years 1 months ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

39

click to vote

NIPS
2003

108views Information Technology» more NIPS 2003»

Policy Search by Dynamic Programming

14 years 1 months ago

Download books.nips.cc

We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...

J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...

claim paper

Read More »

42

click to vote

UAI
2008

234views Artificial Intelligence» more UAI 2008»

Improving Gradient Estimation by Incorporating Sensor Data

14 years 1 months ago

Download www.cs.berkeley.edu

An efficient policy search algorithm should estimate the local gradient of the objective function, with respect to the policy parameters, from as few trials as possible. Whereas m...

Gregory Lawrence, Stuart J. Russell

claim paper

Read More »

41

click to vote

AAAI
2010

191views Intelligent Agents» more AAAI 2010»

Relative Entropy Policy Search

14 years 1 months ago

Download www.kyb.tuebingen.mpg.de

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...

Jan Peters, Katharina Mülling, Yasemin Altun

claim paper

Read More »

41

click to vote

ICRA
2009
IEEE

132views Robotics» more ICRA 2009»

Smoothed Sarsa: Reinforcement learning for robot delivery tasks

14 years 7 months ago

Download alumni.media.mit.edu

— Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to ﬁnd a sequence of actio...

Deepak Ramachandran, Rakesh Gupta

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers