Search Sciweavers | Sciweavers

286 search results - page 8 / 58

» Using inaccurate models in reinforcement learning

click to vote

IROS
2007
IEEE

136views Robotics» more IROS 2007»

Affordance-based imitation learning in robots

14 years 2 months ago

Download users.isr.ist.utl.pt

— In this paper we build an imitation learning algorithm for a humanoid robot on top of a general world model provided by learned object affordances. We consider that the robot h...

Manuel Lopes, Francisco S. Melo, Luis Montesano

claim paper

Read More »

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

13 years 9 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

click to vote

PKDD
2010
Springer

129views Data Mining» more PKDD 2010»

Smarter Sampling in Model-Based Bayesian Reinforcement Learning

13 years 6 months ago

Download www.cs.mcgill.ca

Abstract. Bayesian reinforcement learning (RL) is aimed at making more efﬁcient use of data samples, but typically uses signiﬁcantly more computation. For discrete Markov Decis...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

click to vote

ICANN
2010
Springer

157views Neural Networks» more ICANN 2010»

Using Reinforcement Learning to Guide the Development of Self-organised Feature Maps for Visual Orienting

13 years 8 months ago

Download personalpages.manchester.ac.uk

We present a biologically inspired neural network model of visual orienting (using saccadic eye movements) in which targets are preferentially selected according to their reward va...

Kevin Brohan, Kevin N. Gurney, Piotr Dudek

claim paper

Read More »

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

14 years 2 months ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

« Prev « First page 8 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers