Search Sciweavers | Sciweavers

87 search results - page 11 / 18

» Direct Policy Search Reinforcement Learning for Robot Contro...

115

Voted

ICML
2009
IEEE

131views Machine Learning» more ICML 2009»

Monte-Carlo simulation balancing

16 years 4 months ago

Download www.cs.ualberta.ca

In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...

David Silver, Gerald Tesauro

claim paper

Read More »

128

Voted

ROBOCUP
2009
Springer

134views Robotics» more ROBOCUP 2009»

Learning Complementary Multiagent Behaviors: A Case Study

15 years 9 months ago

Download teamcore.usc.edu

As the reach of multiagent reinforcement learning extends to more and more complex tasks, it is likely that the diverse challenges posed by some of these tasks can only be address...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

151

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

15 years 8 months ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

133

Voted

IROS
2007
IEEE

172views Robotics» more IROS 2007»

Motor control optimization of compliant one-legged locomotion in rough terrain

15 years 9 months ago

Download groups.csail.mit.edu

— While underactuated robotic systems are capable of energy efﬁcient and rapid dynamic behavior, we still do not fully understand how body dynamics can be actively used for ada...

Fumiya Iida, Russ Tedrake

claim paper

Read More »

174

click to vote

ESANN
2008

278views Neural Networks» more ESANN 2008»

Learning to play Tetris applying reinforcement learning methods

15 years 4 months ago

Download www.dice.ucl.ac.be

In this paper the application of reinforcement learning to Tetris is investigated, particulary the idea of temporal difference learning is applied to estimate the state value funct...

Alexander Groß, Jan Friedland, Friedhelm Sch...

claim paper

Read More »

« Prev « First page 11 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers