Search Sciweavers | Sciweavers

178 search results - page 18 / 36

» Probabilistic policy reuse in a reinforcement learning agent

click to vote

ESANN
2008

278views Neural Networks» more ESANN 2008»

Learning to play Tetris applying reinforcement learning methods

13 years 10 months ago

Download www.dice.ucl.ac.be

In this paper the application of reinforcement learning to Tetris is investigated, particulary the idea of temporal difference learning is applied to estimate the state value funct...

Alexander Groß, Jan Friedland, Friedhelm Sch...

claim paper

Read More »

click to vote

JMLR
2002

125views more JMLR 2002»

Lyapunov Design for Safe Reinforcement Learning

13 years 8 months ago

Download www-anw.cs.umass.edu

Lyapunov design methods are used widely in control engineering to design controllers that achieve qualitative objectives, such as stabilizing a system or maintaining a system'...

Theodore J. Perkins, Andrew G. Barto

claim paper

Read More »

click to vote

AGENTS
2001
Springer

247views Security Privacy» more AGENTS 2001»

Hierarchical multi-agent reinforcement learning

14 years 1 months ago

Download www-anw.cs.umass.edu

In this paper, we investigate the use of hierarchical reinforcement learning (HRL) to speed up the acquisition of cooperative multi-agent tasks. We introduce a hierarchical multi-a...

Rajbala Makar, Sridhar Mahadevan, Mohammad Ghavamz...

claim paper

Read More »

click to vote

JAIR
2011

144views more JAIR 2011»

Non-Deterministic Policies in Markovian Decision Processes

13 years 3 months ago

Download www.jair.org

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

13 years 6 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

« Prev « First page 18 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers