Search Sciweavers | Sciweavers

66 search results - page 9 / 14

» Bayesian Multi-Task Reinforcement Learning

180

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 7 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

150

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

15 years 1 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

190

click to vote

ICML
2010
IEEE

188views Machine Learning» more ICML 2010»

Constructing States for Reinforcement Learning

15 years 4 months ago

Download www.icml2010.org

POMDPs are the models of choice for reinforcement learning (RL) tasks where the environment cannot be observed directly. In many applications we need to learn the POMDP structure ...

M. M. Hassan Mahmud

claim paper

Read More »

165

Voted

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

16 years 7 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

180

click to vote

ICRA
2009
IEEE

132views Robotics» more ICRA 2009»

Smoothed Sarsa: Reinforcement learning for robot delivery tasks

16 years 1 months ago

Download alumni.media.mit.edu

— Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to ﬁnd a sequence of actio...

Deepak Ramachandran, Rakesh Gupta

claim paper

Read More »

« Prev « First page 9 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers