Search Sciweavers | Sciweavers

473 search results - page 47 / 95

» Optimal policy switching algorithms for reinforcement learni...

123

click to vote

AAAI
2008

141views Intelligent Agents» more AAAI 2008»

Economic Hierarchical Q-Learning

15 years 8 months ago

Download www.aaai.org

Hierarchical state decompositions address the curse-ofdimensionality in Q-learning methods for reinforcement learning (RL) but can suffer from suboptimality. In addressing this, w...

Erik G. Schultink, Ruggiero Cavallo, David C. Park...

claim paper

Read More »

139

click to vote

SIAMCO
2000

117views more SIAMCO 2000»

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

15 years 5 months ago

Download eprints.iisc.ernet.in

It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE. This in turn implies convergen...

Vivek S. Borkar, Sean P. Meyn

claim paper

Read More »

192

click to vote

AI
1999
Springer

264views Artificial Intelligence» more AI 1999»

Cooperative Behavior Acquisition for Mobile Robots in Dynamically Changing Real Worlds Via Vision-Based Reinforcement Learning a

15 years 5 months ago

Download www.mendeley.com

In this paper, we first discuss the meaning of physical embodiment and the complexity of the environment in the context of multi-agent learning. We then propose a vision-based rei...

Minoru Asada, Eiji Uchibe, Koh Hosoda

claim paper

Read More »

182

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

15 years 7 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

202

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Kernel-Based Reinforcement Learning on Representative States

13 years 8 months ago

Download www.bkveton.com

Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batchmod...

Branislav Kveton, Georgios Theocharous

claim paper

Read More »

« Prev « First page 47 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers