Sciweavers

286 search results - page 8 / 58
» Using inaccurate models in reinforcement learning
Sort
View
IROS
2007
IEEE
136views Robotics» more  IROS 2007»
14 years 2 months ago
Affordance-based imitation learning in robots
— In this paper we build an imitation learning algorithm for a humanoid robot on top of a general world model provided by learned object affordances. We consider that the robot h...
Manuel Lopes, Francisco S. Melo, Luis Montesano
NIPS
2000
13 years 9 months ago
Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task
The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...
Brian Sallans, Geoffrey E. Hinton
PKDD
2010
Springer
129views Data Mining» more  PKDD 2010»
13 years 6 months ago
Smarter Sampling in Model-Based Bayesian Reinforcement Learning
Abstract. Bayesian reinforcement learning (RL) is aimed at making more efficient use of data samples, but typically uses significantly more computation. For discrete Markov Decis...
Pablo Samuel Castro, Doina Precup
ICANN
2010
Springer
13 years 8 months ago
Using Reinforcement Learning to Guide the Development of Self-organised Feature Maps for Visual Orienting
We present a biologically inspired neural network model of visual orienting (using saccadic eye movements) in which targets are preferentially selected according to their reward va...
Kevin Brohan, Kevin N. Gurney, Piotr Dudek
IROS
2009
IEEE
206views Robotics» more  IROS 2009»
14 years 2 months ago
Bayesian reinforcement learning in continuous POMDPs with gaussian processes
— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...
Patrick Dallaire, Camille Besse, Stéphane R...