Search Sciweavers | Sciweavers

486 search results - page 14 / 98

» A Bayesian Framework for Reinforcement Learning

155

click to vote

HPDC
2009
IEEE

108views Distributed And Parallel Com...» more HPDC 2009»

Maestro: a self-organizing peer-to-peer dataflow framework using reinforcement learning

15 years 10 months ago

Download www.cs.vu.nl

In this paper we describe Maestro, a dataflow computation framework for Ibis, our Java-based grid middleware. The novelty of Maestro is that it is a self-organizing peer-to-peer s...

C. van Reeuwijk

claim paper

Read More »

149

click to vote

AI
1999
Springer

110views Artificial Intelligence» more AI 1999»

Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning

15 years 6 months ago

Download webdocs.cs.ualberta.ca

Richard S. Sutton, Doina Precup, Satinder P. Singh

claim paper

Read More »

146

click to vote

ATAL
2009
Springer

125views Intelligent Agents» more ATAL 2009»

Abstraction and Generalization in Reinforcement Learning: A Summary and Framework

15 years 4 months ago

Download www.personeel.unimaas.nl

Marc J. V. Ponsen, Matthew E. Taylor, Karl Tuyls

claim paper

Read More »

182

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning

16 years 7 months ago

Download reference.kfupm.edu.sa

Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...

Sridhar Mahadevan

claim paper

Read More »

194

Voted

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

16 years 7 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

« Prev « First page 14 / 98 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers