Search Sciweavers | Sciweavers

536 search results - page 41 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

155

Voted

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

15 years 4 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

233

click to vote

ICML
1995
IEEE

196views Machine Learning» more ICML 1995»

Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem

16 years 7 months ago

Download www.idsia.ch

In this paper we introduce Ant-Q, a family of algorithms which present many similarities with Q-learning (Watkins, 1989), and which we apply to the solution of symmetric and asymm...

Luca Maria Gambardella, Marco Dorigo

claim paper

Read More »

190

click to vote

ICAC
2006
IEEE

112views Applied Computing» more ICAC 2006»

A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation

16 years 1 months ago

Download userweb.cs.utexas.edu

— Reinforcement Learning (RL) provides a promising new approach to systems performance management that differs radically from standard queuing-theoretic approaches making use of ...

Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mo...

claim paper

Read More »

174

click to vote

ICML
2005
IEEE

188views Machine Learning» more ICML 2005»

High speed obstacle avoidance using monocular vision and reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

We consider the task of driving a remote control car at high speeds through unstructured outdoor environments. We present an approach in which supervised learning is first used to...

Jeff Michels, Ashutosh Saxena, Andrew Y. Ng

claim paper

Read More »

185

click to vote

KCAP
2009
ACM

171views Information Technology» more KCAP 2009»

Interactively shaping agents via human reinforcement: the TAMER framework

16 years 1 months ago

Download userweb.cs.utexas.edu

As computational learning agents move into domains that incur real costs (e.g., autonomous driving or ﬁnancial investment), it will be necessary to learn good policies without n...

W. Bradley Knox, Peter Stone

claim paper

Read More »

« Prev « First page 41 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers