Search Sciweavers | Sciweavers

128 search results - page 11 / 26

» Hierarchically Optimal Average Reward Reinforcement Learning

215

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

15 years 1 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

182

click to vote

NIPS
1997

113views Information Technology» more NIPS 1997»

Nonparametric Model-Based Reinforcement Learning

15 years 8 months ago

Download www.cs.cmu.edu

This paper describes some of the interactions of model learning algorithms and planning algorithms we have found in exploring model-based reinforcement learning. The paper focuses...

Christopher G. Atkeson

claim paper

Read More »

193

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

15 years 11 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

206

Voted

AAAI
2006

190views Intelligent Agents» more AAAI 2006»

Action Selection in Bayesian Reinforcement Learning

15 years 8 months ago

Download www.aaai.org

My research attempts to address on-line action selection in reinforcement learning from a Bayesian perspective. The idea is to develop more effective action selection techniques b...

Tao Wang

claim paper

Read More »

168

click to vote

ICML
2003
IEEE

105views Machine Learning» more ICML 2003»

Principled Methods for Advising Reinforcement Learning Agents

16 years 7 months ago

Download www.hpl.hp.com

An important issue in reinforcement learning is how to incorporate expert knowledge in a principled manner, especially as we scale up to real-world tasks. In this paper, we presen...

Eric Wiewiora, Garrison W. Cottrell, Charles Elkan

claim paper

Read More »

« Prev « First page 11 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers