Sciweavers

81 search results - page 12 / 17
» An extended policy gradient algorithm for robot task learnin...
Sort
View
AAAI
2010
13 years 8 months ago
Biped Walk Learning Through Playback and Corrective Demonstration
Developing a robust, flexible, closed-loop walking algorithm for a humanoid robot is a challenging task due to the complex dynamics of the general biped walk. Common analytical ap...
Çetin Meriçli, Manuela M. Veloso
CORR
2010
Springer
156views Education» more  CORR 2010»
13 years 7 months ago
Imitation learning of motor primitives and language bootstrapping in robots
Abstract— Imitation learning in robots, also called programing by demonstration, has made important advances in recent years, allowing humans to teach context dependant motor ski...
Thomas Cederborg, Pierre-Yves Oudeyer
GECCO
2006
Springer
208views Optimization» more  GECCO 2006»
14 years 8 days ago
Comparing evolutionary and temporal difference methods in a reinforcement learning domain
Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...
Matthew E. Taylor, Shimon Whiteson, Peter Stone
AI
2007
Springer
14 years 2 months ago
Competition and Coordination in Stochastic Games
Agent competition and coordination are two classical and most important tasks in multiagent systems. In recent years, there was a number of learning algorithms proposed to resolve ...
Andriy Burkov, Abdeslam Boularias, Brahim Chaib-dr...
CORR
2011
Springer
219views Education» more  CORR 2011»
13 years 3 months ago
Active Markov Information-Theoretic Path Planning for Robotic Environmental Sensing
Recent research in multi-robot exploration and mapping has focused on sampling environmental fields, which are typically modeled using the Gaussian process (GP). Existing informa...
Kian Hsiang Low, John M. Dolan, Pradeep K. Khosla