Sciweavers

1234 search results - page 33 / 247
» Multi-criteria Reinforcement Learning
Sort
View
113
Voted
ICML
1997
IEEE
16 years 4 months ago
Expected Mistake Bound Model for On-Line Reinforcement Learning
Claude-Nicolas Fiechter
ECML
1998
Springer
15 years 8 months ago
Theoretical Results on Reinforcement Learning with Temporally Abstract Options
Doina Precup, Richard S. Sutton, Satinder P. Singh
EWRL
2008
15 years 5 months ago
Exploiting Additive Structure in Factored MDPs for Reinforcement Learning
Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...
102
Voted
ML
2008
ACM
15 years 3 months ago
Transfer in variable-reward hierarchical reinforcement learning
Neville Mehta, Sriraam Natarajan, Prasad Tadepalli...
ICAART
2010
INSTICC
16 years 1 months ago
Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning
There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...
Christos Dimitrakakis