Search Sciweavers | Sciweavers

95 search results - page 6 / 19

» Policy Gradients for Cryptanalysis

176

click to vote

NIPS
2008

116views Information Technology» more NIPS 2008»

Particle Filter-based Policy Gradient in POMDPs

15 years 8 months ago

Download eprints.pascal-network.org

Our setting is a Partially Observable Markov Decision Process with continuous state, observation and action spaces. Decisions are based on a Particle Filter for estimating the bel...

Pierre-Arnaud Coquelin, Romain Deguest, Rém...

claim paper

Read More »

171

click to vote

ICRA
2005
IEEE

159views Robotics» more ICRA 2005»

Learning Sensory Feedback to CPG with Policy Gradient for Biped Locomotion

16 years 12 days ago

Download www.cns.atr.jp

— This paper proposes a learning framework for a CPG-based biped locomotion controller using a policy gradient method. Our goal in this study is to develop an efﬁcient learning...

Takamitsu Matsubara, Jun Morimoto, Jun Nakanishi, ...

claim paper

Read More »

166

Voted

ICANN
2010
Springer

164views Neural Networks» more ICANN 2010»

Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients

15 years 7 months ago

Download www.idsia.ch

Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...

Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...

claim paper

Read More »

131

click to vote

IROS
2007
IEEE

123views Robotics» more IROS 2007»

An extended policy gradient algorithm for robot task learning

16 years 1 months ago