Sciweavers

4544 search results - page 150 / 909
» Reinforcement Learning with Time
Sort
View
ECAL
2007
Springer
14 years 2 months ago
Genotype Reuse More Important than Genotype Size in Evolvability of Embodied Neural Networks
odel of Embodiment on Abstract Systems: from Hierarchy to Heterarchy Kohei Nakajima, Soya Shinkai, Takashi Ikegami A Behavior-Based Model of the Hydra, Phylum Cnidaria Malin Aktius...
Chad W. Seys, Randall D. Beer
AI
2002
Springer
13 years 10 months ago
Programming backgammon using self-teaching neural nets
TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results. Starting from random initial play, TD...
Gerald Tesauro
COLT
2010
Springer
13 years 8 months ago
An Asymptotically Optimal Bandit Algorithm for Bounded Support Models
Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...
Junya Honda, Akimichi Takemura
ICML
2005
IEEE
14 years 11 months ago
Interactive learning of mappings from visual percepts to actions
We introduce flexible algorithms that can automatically learn mappings from images to actions by interacting with their environment. They work by introducing an image classifier i...
Justus H. Piater, Sébastien Jodogne
ATAL
2009
Springer
14 years 4 months ago
Learning of coordination: exploiting sparse interactions in multiagent systems
Creating coordinated multiagent policies in environments with uncertainty is a challenging problem, which can be greatly simplified if the coordination needs are known to be limi...
Francisco S. Melo, Manuela M. Veloso