Search Sciweavers | Sciweavers

81 search results - page 4 / 17

» The Optimal Reward Baseline for Gradient-Based Reinforcement...

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Reinforcement learning by reward-weighted regression for operational space control

14 years 8 months ago

Download www.machinelearning.org

Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...

Jan Peters, Stefan Schaal

claim paper

Read More »

click to vote

KI
2007
Springer

124views Artificial Intelligence» more KI 2007»

Making a Robot Learn to Play Soccer Using Reward and Punishment

14 years 1 months ago

Download www.ni.uos.de

In this paper, we show how reinforcement learning can be applied to real robots to achieve optimal robot behavior. As example, we enable an autonomous soccer robot to learn interce...

Heiko Müller, Martin Lauer, Roland Hafner, Sa...

claim paper

Read More »

click to vote

ICML
2008
IEEE

105views Machine Learning» more ICML 2008»

Learning all optimal policies with multiple criteria

14 years 8 months ago

Download leon.barrettnexus.com

We describe an algorithm for learning in the presence of multiple criteria. Our technique generalizes previous approaches in that it can learn optimal policies for all linear pref...

Leon Barrett, Srini Narayanan

claim paper

Read More »

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

14 years 1 months ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

13 years 9 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

« Prev « First page 4 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers