Search Sciweavers | Sciweavers

166 search results - page 18 / 34

» Safe exploration for reinforcement learning

137

Voted

ICML
2002
IEEE

138views Machine Learning» more ICML 2002»

Reinforcement Learning and Shaping: Encouraging Intended Behaviors

16 years 6 months ago

Download www.grappa.univ-lille3.fr

We explore dynamic shaping to integrate our prior beliefs of the final policy into a conventional reinforcement learning system. Shaping provides a positive or negative artificial...

Adam Laud, Gerald DeJong

claim paper

Read More »

175

click to vote

ICML
2007
IEEE

172views Machine Learning» more ICML 2007»

Conditional random fields for multi-agent reinforcement learning

16 years 6 months ago

Download www.machinelearning.org

Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...

Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...

claim paper

Read More »

161

click to vote

ICML
2003
IEEE

104views Machine Learning» more ICML 2003»

The Influence of Reward on the Speed of Reinforcement Learning: An Analysis of Shaping

15 years 11 months ago

Download www.hpl.hp.com

Shaping can be an effective method for improving the learning rate in reinforcement systems. Previously, shaping has been heuristically motivated and implemented. We provide a for...

Adam Laud, Gerald DeJong

claim paper

Read More »

176

Voted

ATAL
2010
Springer

115views Intelligent Agents» more ATAL 2010»

Self-organization for coordinating decentralized reinforcement learning

15 years 7 months ago

Download www.cs.umass.edu

Decentralized reinforcement learning (DRL) has been applied to a number of distributed applications. However, one of the main challenges faced by DRL is its convergence. Previous ...

Chongjie Zhang, Victor R. Lesser, Sherief Abdallah

claim paper

Read More »

151

click to vote

CORR
2006
Springer

101views Education» more CORR 2006»

Metric State Space Reinforcement Learning for a Vision-Capable Mobile Robot

15 years 5 months ago

Download www.idsia.ch

We address the problem of autonomously learning controllers for visioncapable mobile robots. We extend McCallum's (1995) Nearest-Sequence Memory algorithm to allow for genera...

Viktor Zhumatiy, Faustino J. Gomez, Marcus Hutter,...

claim paper

Read More »

« Prev « First page 18 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers