Sciweavers

1234 search results - page 33 / 247

» Multi-criteria Reinforcement Learning

131

ICML
1997
IEEE

135views Machine Learning» more ICML 1997»

Expected Mistake Bound Model for On-Line Reinforcement Learning

16 years 6 months ago

Expected Mistake Bound Model for On-Line Reinforcement Learning

Download www.cs.ualberta.ca

Claude-Nicolas Fiechter

claim paper

Read More »

129

ECML
1998
Springer

85views Machine Learning» more ECML 1998»

Theoretical Results on Reinforcement Learning with Temporally Abstract Options

15 years 10 months ago

Theoretical Results on Reinforcement Learning with Temporally Abstract Options

Download webdocs.cs.ualberta.ca

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

133

EWRL
2008

133views Machine Learning» more EWRL 2008»

Exploiting Additive Structure in Factored MDPs for Reinforcement Learning

15 years 7 months ago

Exploiting Additive Structure in Factored MDPs for Reinforcement Learning

Download ewrl08.futurs.inria.fr

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

124

ML
2008
ACM

95views Machine Learning» more ML 2008»

Transfer in variable-reward hierarchical reinforcement learning

15 years 5 months ago

Transfer in variable-reward hierarchical reinforcement learning

Download web.engr.oregonstate.edu

Neville Mehta, Sriraam Natarajan, Prasad Tadepalli...

claim paper

Read More »

277

ICAART
2010
INSTICC

509views Intelligent Agents» more ICAART 2010»

Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning

16 years 3 months ago

Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning

Download arxiv.org

There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...

Christos Dimitrakakis

posted by olethros

Read More »

« Prev « First page 33 / 247 Last » Next »