Search Sciweavers | Sciweavers

1233 search results - page 10 / 247

» Reinforcement Learning in MirrorBot

click to vote

ML
2000
ACM

133views Machine Learning» more ML 2000»

Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms

15 years 2 months ago

Download www.cs.rutgers.edu

Satinder P. Singh, Tommi Jaakkola, Michael L. Litt...

claim paper

Read More »

click to vote

NIPS
2000

112views Information Technology» more NIPS 2000»

Balancing Multiple Sources of Reward in Reinforcement Learning

15 years 4 months ago

Download www.cc.gatech.edu

For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems i...

Christian R. Shelton

claim paper

Read More »

131

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

15 years 6 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

156

click to vote

ICMLA
2003

169views Machine Learning» more ICMLA 2003»

Reinforcement Learning Task Clustering

15 years 4 months ago

Download james.jlcarroll.net

This work represents the ﬁrst step towards a task library system in the reinforcement learning domain. Task libraries could be useful in speeding up the learning of new tasks th...

James L. Carroll, Todd S. Peterson, Kevin D. Seppi

claim paper

Read More »

129

click to vote

NIPS
1992

84views Information Technology» more NIPS 1992»

Using Aperiodic Reinforcement for Directed Self-Organization During Development

15 years 4 months ago

Download www.hnl.bcm.tmc.edu

We present a local learning rule in which Hebbian learning is conditional on an incorrect prediction of a reinforcement signal. We propose a biological interpretation of such a fr...

P. Read Montague, Peter Dayan, Steven J. Nowlan, T...

claim paper

Read More »

« Prev « First page 10 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers