policy gradient methods

152

NIPS
2003

128views Information Technology» more NIPS 2003»

Distributed Optimization in Adaptive Networks

15 years 8 months ago

We develop a protocol for optimizing dynamic behavior of a network of simple electronic components, such as a sensor network, an ad hoc network of mobile devices, or a network of ...

Ciamac Cyrus Moallemi, Benjamin Van Roy

claim paper

Read More »

163

click to vote

ESANN
2008

115views Neural Networks» more ESANN 2008»

15 years 8 months ago

Similarities and differences between policy gradient methods and evolution strategies

Download www.dice.ucl.ac.be

Natural policy gradient methods and the covariance matrix adaptation evolution strategy, two variable metric methods proposed for solving reinforcement learning tasks, are contrast...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

184

click to vote

IROS
2006
IEEE

113views Robotics» more IROS 2006»

Policy Gradient Methods for Robotics

16 years 17 days ago

Download www.cs.utah.edu

— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...

Jan Peters, Stefan Schaal

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers