Search Sciweavers | Sciweavers

473 search results - page 13 / 95

» Optimal policy switching algorithms for reinforcement learni...

click to vote

GECCO
2009
Springer

124views Optimization» more GECCO 2009»

Reinforcement learning for games: failures and successes

14 years 9 days ago

Download www.gm.fh-koeln.de

We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...

Wolfgang Konen, Thomas Bartz-Beielstein

claim paper

Read More »

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

14 years 8 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

click to vote

ICML
2000
IEEE

155views Machine Learning» more ICML 2000»

Combining Reinforcement Learning with a Local Control Algorithm

14 years 8 months ago

Download www-anw.cs.umass.edu

We explore combining reinforcement learning with a hand-crafted local controller in a manner suggested by the chaotic control algorithm of Vincent, Schmitt and Vincent (1994). A c...

Andrew G. Barto, Jette Randløv, Michael T. ...

claim paper

Read More »

click to vote

ATAL
2006
Springer

142views Intelligent Agents» more ATAL 2006»

Probabilistic policy reuse in a reinforcement learning agent

13 years 11 months ago

Download www.cs.cmu.edu

We contribute Policy Reuse as a technique to improve a reinforcement learning agent with guidance from past learned similar policies. Our method relies on using the past policies ...

Fernando Fernández, Manuela M. Veloso

claim paper

Read More »

click to vote

ICML
2007
IEEE

172views Machine Learning» more ICML 2007»

Conditional random fields for multi-agent reinforcement learning

14 years 8 months ago

Download www.machinelearning.org

Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...

Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...

claim paper

Read More »

« Prev « First page 13 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers