Search Sciweavers | Sciweavers

92 search results - page 6 / 19

» A General Convergence Method for Reinforcement Learning in t...

click to vote

ICML
1997
IEEE

194views Machine Learning» more ICML 1997»

Hierarchical Explanation-Based Reinforcement Learning

14 years 7 months ago

Download reference.kfupm.edu.sa

Explanation-Based Reinforcement Learning (EBRL) was introduced by Dietterich and Flann as a way of combining the ability of Reinforcement Learning (RL) to learn optimal plans with...

Prasad Tadepalli, Thomas G. Dietterich

claim paper

Read More »

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation

14 years 7 months ago

Download www.machinelearning.org

Reinforcement learning algorithms can become unstable when combined with linear function approximation. Algorithms that minimize the mean-square Bellman error are guaranteed to co...

Chee Wee Phua, Robert Fitch

claim paper

Read More »

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

13 years 8 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

13 years 6 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

click to vote

ATAL
2008
Springer

176views Intelligent Agents» more ATAL 2008»

Analysis of an evolutionary reinforcement learning method in a multiagent domain

13 years 8 months ago

Download www.aamas-conference.org

Many multiagent problems comprise subtasks which can be considered as reinforcement learning (RL) problems. In addition to classical temporal difference methods, evolutionary algo...

Jan Hendrik Metzen, Mark Edgington, Yohannes Kassa...

claim paper

Read More »

« Prev « First page 6 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers