Sciweavers

4544 search results - page 43 / 909

» Reinforcement Learning with Time

133

EWRL
2008

133views Machine Learning» more EWRL 2008»

Exploiting Additive Structure in Factored MDPs for Reinforcement Learning

15 years 7 months ago

Exploiting Additive Structure in Factored MDPs for Reinforcement Learning

Download ewrl08.futurs.inria.fr

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

124

ML
2008
ACM

95views Machine Learning» more ML 2008»

Transfer in variable-reward hierarchical reinforcement learning

15 years 6 months ago

Transfer in variable-reward hierarchical reinforcement learning

Download web.engr.oregonstate.edu

Neville Mehta, Sriraam Natarajan, Prasad Tadepalli...

claim paper

Read More »

119

ML
2000
ACM

133views Machine Learning» more ML 2000»

Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms

15 years 5 months ago

Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms

Download www.cs.rutgers.edu

Satinder P. Singh, Tommi Jaakkola, Michael L. Litt...

claim paper

Read More »

280

ICAART
2010
INSTICC

509views Intelligent Agents» more ICAART 2010»

Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning

16 years 3 months ago

Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning

Download arxiv.org

There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...

Christos Dimitrakakis

posted by olethros

Read More »

112

NIPS
2000

112views Information Technology» more NIPS 2000»

Balancing Multiple Sources of Reward in Reinforcement Learning

15 years 7 months ago

Balancing Multiple Sources of Reward in Reinforcement Learning

Download www.cc.gatech.edu

For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems i...

Christian R. Shelton

claim paper

Read More »

« Prev « First page 43 / 909 Last » Next »