Search Sciweavers | Sciweavers

1235 search results - page 137 / 247

» Reinforcement learning in a nutshell

154

Voted

CEC
2005
IEEE

98views Artificial Intelligence» more CEC 2005»

XCS with computed prediction in continuous multistep environments

15 years 5 months ago

Download www.eskimo.com

We apply XCS with computed prediction (XCSF) to tackle multistep reinforcement learning problems involving continuous inputs. In essence we use XCSF as a method of generalized rein...

Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...

claim paper

Read More »

117

Voted

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 4 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

166

Voted

AAAI
1996

191views Intelligent Agents» more AAAI 1996»

Evolution-Based Discovery of Hierarchical Behaviors

15 years 4 months ago

Download www.aaai.org

Procedural representations of control policies have two advantages when facing the scale-up problem in learning tasks. First they are implicit, with potential for inductive genera...

Justinian P. Rosca, Dana H. Ballard

claim paper

Read More »

132

click to vote

SOCROB
2010

126views Robotics» more SOCROB 2010»

Using the Interaction Rhythm as a Natural Reinforcement Signal for Social Robots: A Matter of Belief

15 years 1 months ago

Download fostsvn.uopnet.plymouth.ac.uk

Abstract. In this paper, we present the results of a pilot study of a human robot interaction experiment where the rhythm of the interaction is used as a reinforcement signal to le...

Antoine Hiolle, Lola Cañamero, Pierre Andry...

claim paper

Read More »

126

Voted

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

15 years 10 months ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

« Prev « First page 137 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers