Search Sciweavers | Sciweavers

3412 search results - page 2 / 683

» Efficient Reinforcement Learning

152

click to vote

COLT
2008
Springer

132views Machine Learning» more COLT 2008»

Adaptive Aggregation for Reinforcement Learning with Efficient Exploration: Deterministic Domains

15 years 8 months ago

Download colt2008.cs.helsinki.fi

We propose a model-based learning algorithm, the Adaptive Aggregation Algorithm (AAA), that aims to solve the online, continuous state space reinforcement learning problem in a de...

Andrey Bernstein, Nahum Shimkin

claim paper

Read More »

196

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Reinforcement learning by reward-weighted regression for operational space control

16 years 7 months ago

Download www.machinelearning.org

Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...

Jan Peters, Stefan Schaal

claim paper

Read More »

169

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 7 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

179

click to vote

ICANN
2009
Springer

123views Neural Networks» more ICANN 2009»

Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data

15 years 10 months ago

Download www.tu-ilmenau.de

In a typical reinforcement learning (RL) setting details of the environment are not given explicitly but have to be estimated from observations. Most RL approaches only optimize th...

Alexander Hans, Steffen Udluft

claim paper

Read More »

164

click to vote

COLT
1994
Springer

91views Machine Learning» more COLT 1994»

Efficient Reinforcement Learning

15 years 10 months ago

Download www.research.rutgers.edu

Realistic domains for learning possess regularities that make it possible to generalize experience across related states. This paper explores an environment-modeling framework tha...

Claude-Nicolas Fiechter

claim paper

Read More »

« Prev « First page 2 / 683 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers