Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions

15 years 1 months ago

Download www.montefiore.ulg.ac.be

—This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-loop policy that can be represented using a given number of basis functions (BFs), where a discrete action is assigned to each BF. The type of the BFs and their number are speciﬁed in advance and determine the complexity of the representation. Considerable ﬂexibility is achieved by optimizing the locations and shapes of the BFs, together with the action assignments. The optimization is carried out with the cross-entropy method and evaluates the policies by their empirical return from a representative set of initial states. The return for each representative state is estimated using Monte Carlo simulations. The resulting algorithm for cross-entropy policy search with adaptive BFs is extensively evaluated in problems with two to six state variables, for which it reliably obtains good policies with only a sma...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro

Real-time Traffic

Algorithm | Cross-entropy Policy Search | Policy Search | TSMC 2011 |

claim paper

» RealTime Population Based Optimization for Adaptive Motion Control of Robot Manipulator

» Optimal crosslayer wireless control policies using TD learning

» An Analysis of CaseBased Value Function Approximation by Approximating State Transition Gr...

» ArchitectureAware Adaptive Deployment of Contextual Security Policies

» Delay and rateoptimal control in a multiclass priority queue with adjustable service rates

» Adaptive routing with stale information

» Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximatio...

» Adaptive autonomous control using online value iteration with gaussian processes

Post Info
More Details (n/a)

Added	15 May 2011
Updated	15 May 2011
Type	Journal
Year	2011
Where	TSMC
Authors	Lucian Busoniu, Damien Ernst, Bart De Schutter, Robert Babuska

Comments (0)

Sciweavers

Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions

Algorithm | Cross-entropy Policy Search | Policy Search | TSMC 2011 |

Explore & Download

Productivity Tools

Sciweavers