Sciweavers

2108 search results - page 85 / 422
» Tracking in Reinforcement Learning
Sort
View
ICONIP
2007
13 years 11 months ago
Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents
The aim of the Cyber Rodent project [1] is to elucidate the origin of our reward and affective systems by building artificial agents that share the natural biological constraints...
Eiji Uchibe, Kenji Doya
IJCAI
2007
13 years 11 months ago
Direct Code Access in Self-Organizing Neural Networks for Reinforcement Learning
TD-FALCON is a self-organizing neural network that incorporates Temporal Difference (TD) methods for reinforcement learning. Despite the advantages of fast and stable learning, TD...
Ah-Hwee Tan
EACL
2006
ACL Anthology
13 years 11 months ago
Using Reinforcement Learning to Build a Better Model of Dialogue State
Given the growing complexity of tasks that spoken dialogue systems are trying to handle, Reinforcement Learning (RL) has been increasingly used as a way of automatically learning ...
Joel R. Tetreault, Diane J. Litman
NIPS
2004
13 years 11 months ago
Intrinsically Motivated Reinforcement Learning
Psychologists call behavior intrinsically motivated when it is engaged in for its own sake rather than as a step toward solving a specific problem of clear practical value. But wh...
Satinder P. Singh, Andrew G. Barto, Nuttapong Chen...
NIPS
2000
13 years 11 months ago
Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task
The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...
Brian Sallans, Geoffrey E. Hinton