Search Sciweavers | Sciweavers

115 search results - page 7 / 23

» Recurrent policy gradients

194

Voted

NIPS
2008

116views Information Technology» more NIPS 2008»

Particle Filter-based Policy Gradient in POMDPs

15 years 9 months ago

Download eprints.pascal-network.org

Our setting is a Partially Observable Markov Decision Process with continuous state, observation and action spaces. Decisions are based on a Particle Filter for estimating the bel...

Pierre-Arnaud Coquelin, Romain Deguest, Rém...

claim paper

Read More »

190

click to vote

ICRA
2005
IEEE

159views Robotics» more ICRA 2005»

Learning Sensory Feedback to CPG with Policy Gradient for Biped Locomotion

16 years 1 months ago

Download www.cns.atr.jp

— This paper proposes a learning framework for a CPG-based biped locomotion controller using a policy gradient method. Our goal in this study is to develop an efﬁcient learning...

Takamitsu Matsubara, Jun Morimoto, Jun Nakanishi, ...

claim paper

Read More »

144

click to vote

IROS
2007
IEEE

123views Robotics» more IROS 2007»

An extended policy gradient algorithm for robot task learning

16 years 1 months ago