Search Sciweavers | Sciweavers

132 search results - page 9 / 27

» Generalization in Reinforcement Learning: Safely Approximati...

213

Voted

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 8 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

205

click to vote

ICML
1998
IEEE

179views Machine Learning» more ICML 1998»

Value Function Based Production Scheduling

16 years 8 months ago

Download www.ri.cmu.edu

Production scheduling, the problem of sequentially con guring a factory to meet forecasted demands, is a critical problem throughout the manufacturing industry. The requirement of...

Jeff G. Schneider, Justin A. Boyan, Andrew W. Moor...

claim paper

Read More »

173

Voted

ATAL
2010
Springer

181views Intelligent Agents» more ATAL 2010»

Basis function construction for hierarchical reinforcement learning

15 years 8 months ago

Download www.cs.brown.edu

This paper introduces an approach to automatic basis function construction for Hierarchical Reinforcement Learning (HRL) tasks. We describe some considerations that arise when con...

Sarah Osentoski, Sridhar Mahadevan

claim paper

Read More »

223

Voted

AAAI
2006

190views Intelligent Agents» more AAAI 2006»

Action Selection in Bayesian Reinforcement Learning

15 years 8 months ago

Download www.aaai.org

My research attempts to address on-line action selection in reinforcement learning from a Bayesian perspective. The idea is to develop more effective action selection techniques b...

Tao Wang

claim paper

Read More »

269

click to vote

EWRL
2008

191views Machine Learning» more EWRL 2008»

Bayesian Reward Filtering

15 years 9 months ago

Download www.metz.supelec.fr

A wide variety of function approximation schemes have been applied to reinforcement learning. However, Bayesian filtering approaches, which have been shown efficient in other field...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

« Prev « First page 9 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers