Search Sciweavers | Sciweavers

3412 search results - page 21 / 683

» Efficient Reinforcement Learning

194

click to vote

ICML
2010
IEEE

222views Machine Learning» more ICML 2010»

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda

15 years 4 months ago

Download www.icml2010.org

Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...

Carlton Downey, Scott Sanner

claim paper

Read More »

192

Voted

CORR
2011
Springer

136views Education» more CORR 2011»

Reinforcement Learning for Agents with Many Sensors and Actuators Acting in Categorizable Environments

14 years 10 months ago

Download www.aaai.org

In this paper, we confront the problem of applying reinforcement learning to agents that perceive the environment through many sensors and that can perform parallel actions using ...

Enric Celaya, Josep M. Porta

claim paper

Read More »

151

click to vote

IEAAIE
2001
Springer

98views Artificial Intelligence» more IEAAIE 2001»

On the Relationship between Learning Capability and the Boltzmann-Formula

15 years 11 months ago

Download members.iif.hu

In this paper a combined use of reinforcement learning and simulated annealing is treated. Most of the simulated annealing methods suggest using heuristic temperature bounds as the...

Péter Stefán, Laszlo Monostori

claim paper

Read More »

148

click to vote

ECML
2005
Springer

95views Machine Learning» more ECML 2005»

Towards Finite-Sample Convergence of Direct Reinforcement Learning

16 years 7 days ago

Download www.cs.uiuc.edu

Abstract. While direct, model-free reinforcement learning often performs better than model-based approaches in practice, only the latter have yet supported theoretical guarantees f...

Shiau Hong Lim, Gerald DeJong

claim paper

Read More »

177

click to vote

ATAL
2006
Springer

103views Intelligent Agents» more ATAL 2006»

Rule value reinforcement learning for cognitive agents

15 years 10 months ago

Download vega.soi.city.ac.uk

RVRL (Rule Value Reinforcement Learning) is a new algorithm which extends an existing learning framework that models the environment of a situated agent using a probabilistic rule...

Christopher Child, Kostas Stathis

claim paper

Read More »

« Prev « First page 21 / 683 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers