Sciweavers

4544 search results - page 43 / 909
» Reinforcement Learning with Time
Sort
View
EWRL
2008
13 years 9 months ago
Exploiting Additive Structure in Factored MDPs for Reinforcement Learning
Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...
ML
2008
ACM
13 years 7 months ago
Transfer in variable-reward hierarchical reinforcement learning
Neville Mehta, Sriraam Natarajan, Prasad Tadepalli...
ML
2000
ACM
133views Machine Learning» more  ML 2000»
13 years 7 months ago
Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms
Satinder P. Singh, Tommi Jaakkola, Michael L. Litt...
ICAART
2010
INSTICC
14 years 4 months ago
Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning
There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...
Christos Dimitrakakis
NIPS
2000
13 years 9 months ago
Balancing Multiple Sources of Reward in Reinforcement Learning
For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems i...
Christian R. Shelton