Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

107

ICML
2003
IEEE

favoriteEmaildiscussreport

165views Machine Learning» more ICML 2003»

The Cross Entropy Method for Fast Policy Search

16 years 2 months ago

The Cross Entropy Method for Fast Policy Search

Download www.hpl.hp.com

We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization algorithms, we use the fast Cross Entropy method. The suggested framework is described for several reward criteria and its effectiveness is demonstrated for a grid world navigation task and for an inventory control problem.

Shie Mannor, Reuven Y. Rubinstein, Yohai Gat

Real-time Traffic

Cross Entropy Method | ICML 2003 | Machine Learning | Markovian Decision Processes | Slow Gradient-based Optimization |

claim paper

Related Content

» The CrossEntropy Method for Policy Search in Decentralized POMDPs

» CrossEntropy Optimization of Control Policies With Adaptive Basis Functions

» Relative Entropy Policy Search

» Global Likelihood Optimization Via the CrossEntropy Method with an Application to Mixture ...

» The cross entropy method for classification

» CrossEntropy for MonteCarlo Tree Search

» A leaderbased parallel cross entropy algorithm for MCP

» Cross entropy and adaptive variance scaling in continuous EDA

» RealTime Population Based Optimization for Adaptive Motion Control of Robot Manipulator

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2003
Where	ICML
Authors	Shie Mannor, Reuven Y. Rubinstein, Yohai Gat

Comments (0)