Search Sciweavers | Sciweavers

2108 search results - page 111 / 422

» Tracking in Reinforcement Learning

click to vote

ICML
2008
IEEE

133views Machine Learning» more ICML 2008»

An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning

14 years 10 months ago

Download www.cs.duke.edu

We show that linear value-function approximation is equivalent to a form of linear model approximation. We then derive a relationship between the model-approximation error and the...

Ronald Parr, Lihong Li, Gavin Taylor, Christopher ...

claim paper

Read More »

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning

14 years 10 months ago

Download reference.kfupm.edu.sa

Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...

Sridhar Mahadevan

claim paper

Read More »

click to vote

ICES
2003
Springer

125views Hardware» more ICES 2003»

Evolving Reinforcement Learning-Like Abilities for Robots

14 years 3 months ago

Download lis.epfl.ch

Abstract. In [8] Yamauchi and Beer explored the abilities of continuous time recurrent neural networks (CTRNNs) to display reinforcementlearning like abilities. The investigated ta...

Jesper Blynel

claim paper

Read More »

click to vote

AAAI
2008

105views Intelligent Agents» more AAAI 2008»

Potential-based Shaping in Model-based Reinforcement Learning

14 years 10 days ago

Download www.aaai.org

Potential-based shaping was designed as a way of introducing background knowledge into model-free reinforcement-learning algorithms. By identifying states that are likely to have ...

John Asmuth, Michael L. Littman, Robert Zinkov

claim paper

Read More »

click to vote

NIPS
2007

149views Information Technology» more NIPS 2007»

Online Linear Regression and Its Application to Model-Based Reinforcement Learning

13 years 11 months ago

Download books.nips.cc

We provide a provably efﬁcient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Speciﬁcally, we take a mo...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

« Prev « First page 111 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers