Search Sciweavers | Sciweavers

536 search results - page 18 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

172

click to vote

PKDD
2009
Springer

181views Data Mining» more PKDD 2009»

Active Learning for Reward Estimation in Inverse Reinforcement Learning

16 years 1 months ago

Download users.isr.ist.utl.pt

Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...

Manuel Lopes, Francisco S. Melo, Luis Montesano

claim paper

Read More »

146

click to vote

ATAL
2010
Springer

181views Intelligent Agents» more ATAL 2010»

Basis function construction for hierarchical reinforcement learning

15 years 7 months ago

Download www.cs.brown.edu

This paper introduces an approach to automatic basis function construction for Hierarchical Reinforcement Learning (HRL) tasks. We describe some considerations that arise when con...

Sarah Osentoski, Sridhar Mahadevan

claim paper

Read More »

199

Voted

ECAI
2010
Springer

211views Artificial Intelligence» more ECAI 2010»

Case-Based Multiagent Reinforcement Learning: Cases as Heuristics for Selection of Actions

15 years 7 months ago

Download www.iiia.csic.es

This work presents a new approach that allows the use of cases in a case base as heuristics to speed up Multiagent Reinforcement Learning algorithms, combining Case-Based Reasoning...

Reinaldo A. C. Bianchi, Ramon López de M&aa...

claim paper

Read More »

187

click to vote

ICML
1998
IEEE

179views Machine Learning» more ICML 1998»

Value Function Based Production Scheduling

16 years 7 months ago

Download www.ri.cmu.edu

Production scheduling, the problem of sequentially con guring a factory to meet forecasted demands, is a critical problem throughout the manufacturing industry. The requirement of...

Jeff G. Schneider, Justin A. Boyan, Andrew W. Moor...

claim paper

Read More »

143

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 11 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

« Prev « First page 18 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers