Search Sciweavers | Sciweavers

584 search results - page 22 / 117

» Reinforcement Learning Task Clustering

click to vote

GECCO
2005
Springer

155views Optimization» more GECCO 2005»

Co-evolving recurrent neurons learn deep memory POMDPs

15 years 8 months ago

Download www.idsia.ch

Recurrent neural networks are theoretically capable of learning complex temporal sequences, but training them through gradient-descent is too slow and unstable for practical use i...

Faustino J. Gomez, Jürgen Schmidhuber

claim paper

Read More »

118

Voted

AR
2007

105views more AR 2007»

Reinforcement learning of a continuous motor sequence with hidden states

15 years 2 months ago

Download www.bdc.brain.riken.go.jp

—Reinforcement learning is the scheme for unsupervised learning in which robots are expected to acquire behavior skills through self-explorations based on reward signals. There a...

Hiroaki Arie, Tetsuya Ogata, Jun Tani, Shigeki Sug...

claim paper

Read More »

126

click to vote

KCAP
2009
ACM

171views Information Technology» more KCAP 2009»

Interactively shaping agents via human reinforcement: the TAMER framework

15 years 9 months ago

Download userweb.cs.utexas.edu

As computational learning agents move into domains that incur real costs (e.g., autonomous driving or ﬁnancial investment), it will be necessary to learn good policies without n...

W. Bradley Knox, Peter Stone

claim paper

Read More »

179

Voted

AGENTS
2001
Springer

247views Security Privacy» more AGENTS 2001»

Hierarchical multi-agent reinforcement learning

15 years 6 months ago

Download www-anw.cs.umass.edu

In this paper, we investigate the use of hierarchical reinforcement learning (HRL) to speed up the acquisition of cooperative multi-agent tasks. We introduce a hierarchical multi-a...

Rajbala Makar, Sridhar Mahadevan, Mohammad Ghavamz...

claim paper

Read More »

122

click to vote

GECCO
2006
Springer

208views Optimization» more GECCO 2006»

Comparing evolutionary and temporal difference methods in a reinforcement learning domain

15 years 6 months ago

Download www.cs.bham.ac.uk

Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

« Prev « First page 22 / 117 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers