Sciweavers

1233 search results - page 10 / 247
» Reinforcement Learning in MirrorBot
Sort
View
ML
2000
ACM
133views Machine Learning» more  ML 2000»
13 years 8 months ago
Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms
Satinder P. Singh, Tommi Jaakkola, Michael L. Litt...
NIPS
2000
13 years 10 months ago
Balancing Multiple Sources of Reward in Reinforcement Learning
For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems i...
Christian R. Shelton
GECCO
2006
Springer
133views Optimization» more  GECCO 2006»
14 years 6 days ago
On-line evolutionary computation for reinforcement learning in stochastic domains
In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...
Shimon Whiteson, Peter Stone
ICMLA
2003
13 years 10 months ago
Reinforcement Learning Task Clustering
This work represents the first step towards a task library system in the reinforcement learning domain. Task libraries could be useful in speeding up the learning of new tasks th...
James L. Carroll, Todd S. Peterson, Kevin D. Seppi
NIPS
1992
13 years 9 months ago
Using Aperiodic Reinforcement for Directed Self-Organization During Development
We present a local learning rule in which Hebbian learning is conditional on an incorrect prediction of a reinforcement signal. We propose a biological interpretation of such a fr...
P. Read Montague, Peter Dayan, Steven J. Nowlan, T...