Search Sciweavers | Sciweavers

133 search results - page 4 / 27

» Hierarchical Policy Gradient Algorithms

176

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 4 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

146

click to vote

ICRA
2005
IEEE

159views Robotics» more ICRA 2005»

Learning Sensory Feedback to CPG with Policy Gradient for Biped Locomotion

15 years 11 months ago

Download www.cns.atr.jp

— This paper proposes a learning framework for a CPG-based biped locomotion controller using a policy gradient method. Our goal in this study is to develop an efﬁcient learning...

Takamitsu Matsubara, Jun Morimoto, Jun Nakanishi, ...

claim paper

Read More »

159

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Multi-Agent Learning with Policy Prediction

15 years 7 months ago

Download www.cs.umass.edu

Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

174

click to vote

IROS
2006
IEEE

113views Robotics» more IROS 2006»

Policy Gradient Methods for Robotics

16 years 1 days ago

Download www.cs.utah.edu

— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...

Jan Peters, Stefan Schaal

claim paper

Read More »

154

click to vote

ICANN
2010
Springer

164views Neural Networks» more ICANN 2010»

Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients

15 years 6 months ago

Download www.idsia.ch

Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...

Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...

claim paper

Read More »

« Prev « First page 4 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers