Sciweavers

4544 search results - page 150 / 909
» Reinforcement Learning with Time
Sort
View
126
Voted
ECAL
2007
Springer
15 years 6 months ago
Genotype Reuse More Important than Genotype Size in Evolvability of Embodied Neural Networks
odel of Embodiment on Abstract Systems: from Hierarchy to Heterarchy Kohei Nakajima, Soya Shinkai, Takashi Ikegami A Behavior-Based Model of the Hydra, Phylum Cnidaria Malin Aktius...
Chad W. Seys, Randall D. Beer
114
Voted
AI
2002
Springer
15 years 2 months ago
Programming backgammon using self-teaching neural nets
TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results. Starting from random initial play, TD...
Gerald Tesauro
132
Voted
COLT
2010
Springer
15 years 24 days ago
An Asymptotically Optimal Bandit Algorithm for Bounded Support Models
Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...
Junya Honda, Akimichi Takemura
124
Voted
ICML
2005
IEEE
16 years 3 months ago
Interactive learning of mappings from visual percepts to actions
We introduce flexible algorithms that can automatically learn mappings from images to actions by interacting with their environment. They work by introducing an image classifier i...
Justus H. Piater, Sébastien Jodogne
123
Voted
ATAL
2009
Springer
15 years 9 months ago
Learning of coordination: exploiting sparse interactions in multiagent systems
Creating coordinated multiagent policies in environments with uncertainty is a challenging problem, which can be greatly simplified if the coordination needs are known to be limi...
Francisco S. Melo, Manuela M. Veloso