Search Sciweavers | Sciweavers

509 search results - page 23 / 102

» Compositional Models for Reinforcement Learning

130

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 3 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

142

Voted

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

15 years 3 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

133

Voted

ICRA
2010
IEEE

137views Robotics» more ICRA 2010»

Robot reinforcement learning using EEG-based reward signals

15 years 1 months ago

Download webdiis.unizar.es

Abstract— Reinforcement learning algorithms have been successfully applied in robotics to learn how to solve tasks based on reward signals obtained during task execution. These r...

Iñaki Iturrate, Luis Montesano, Javier Ming...

claim paper

Read More »

130

Voted

WOSS
2004
ACM

128views Software Engineering» more WOSS 2004»

Self-managed decentralised systems using K-components and collaborative reinforcement learning

15 years 8 months ago

Download www.scss.tcd.ie

Components in a decentralised system are faced with uncertainty as how to best adapt to a changing environment to maintain or optimise system performance. How can individual compo...

Jim Dowling, Vinny Cahill

claim paper

Read More »

click to vote

ECML
2004
Springer

77views Machine Learning» more ECML 2004»

Filtered Reinforcement Learning

15 years 7 months ago

Download eprints.pascal-network.org

Reinforcement learning (RL) algorithms attempt to assign the credit for rewards to the actions that contributed to the reward. Thus far, credit assignment has been done in one of t...

Douglas Aberdeen

claim paper

Read More »

« Prev « First page 23 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers