Search Sciweavers | Sciweavers

163 search results - page 4 / 33

» Policy Gradient Methods for Robotics

129

click to vote

ICRA
2010
IEEE

149views Robotics» more ICRA 2010»

A simple learning strategy for high-speed quadrocopter multi-flips

15 years 5 months ago

Download www.idsc.ethz.ch

— We describe a simple and intuitive policy gradient method for improving parametrized quadrocopter multi-ﬂips by combining iterative experiments with information from a ﬁrst...

Sergei Lupashin, Angela Schöllig, Michael She...

claim paper

Read More »

167

Voted

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

16 years 7 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

206

click to vote

ICANN
2010
Springer

201views Neural Networks» more ICANN 2010»

Policy Gradients for Cryptanalysis

15 years 7 months ago

Download www6.in.tum.de

So-called Physical Unclonable Functions are an emerging, new cryptographic and security primitive. They can potentially replace secret binary keys in vulnerable hardware systems an...

Frank Sehnke, Christian Osendorfer, Jan Sölte...

claim paper

Read More »

174

click to vote

AAAI
2010

191views Intelligent Agents» more AAAI 2010»

Relative Entropy Policy Search

15 years 8 months ago

Download www.kyb.tuebingen.mpg.de

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...

Jan Peters, Katharina Mülling, Yasemin Altun

claim paper

Read More »

176

click to vote

IJCAI
2003

169views Artificial Intelligence» more IJCAI 2003»

Covariant Policy Search

15 years 8 months ago

Download www.ri.cmu.edu

We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...

J. Andrew Bagnell, Jeff G. Schneider

claim paper

Read More »

« Prev « First page 4 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers