Search Sciweavers | Sciweavers

81 search results - page 12 / 17

» An extended policy gradient algorithm for robot task learnin...

125

click to vote

AAAI
2010

151views Intelligent Agents» more AAAI 2010»

Biped Walk Learning Through Playback and Corrective Demonstration

15 years 3 months ago

Download www.cs.cmu.edu

Developing a robust, flexible, closed-loop walking algorithm for a humanoid robot is a challenging task due to the complex dynamics of the general biped walk. Common analytical ap...

Çetin Meriçli, Manuela M. Veloso

claim paper

Read More »

135

click to vote

CORR
2010
Springer

156views Education» more CORR 2010»

Imitation learning of motor primitives and language bootstrapping in robots

15 years 1 months ago

Download flowers.inria.fr

Abstract— Imitation learning in robots, also called programing by demonstration, has made important advances in recent years, allowing humans to teach context dependant motor ski...

Thomas Cederborg, Pierre-Yves Oudeyer

claim paper

Read More »

132

click to vote

GECCO
2006
Springer

208views Optimization» more GECCO 2006»

Comparing evolutionary and temporal difference methods in a reinforcement learning domain

15 years 6 months ago

Download www.cs.bham.ac.uk

Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

119

click to vote

AI
2007
Springer

183views Artificial Intelligence» more AI 2007»

Competition and Coordination in Stochastic Games

15 years 9 months ago

Download www.damas.ift.ulaval.ca

Agent competition and coordination are two classical and most important tasks in multiagent systems. In recent years, there was a number of learning algorithms proposed to resolve ...

Andriy Burkov, Abdeslam Boularias, Brahim Chaib-dr...

claim paper

Read More »

165

click to vote

CORR
2011
Springer

219views Education» more CORR 2011»

Active Markov Information-Theoretic Path Planning for Robotic Environmental Sensing

14 years 10 months ago

Download www.comp.nus.edu.sg

Recent research in multi-robot exploration and mapping has focused on sampling environmental ﬁelds, which are typically modeled using the Gaussian process (GP). Existing informa...

Kian Hsiang Low, John M. Dolan, Pradeep K. Khosla

claim paper

Read More »

« Prev « First page 12 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers