Sciweavers

4544 search results - page 79 / 909
» Reinforcement Learning with Time
Sort
View
127
Voted
DAGM
2006
Springer
15 years 6 months ago
Handling Camera Movement Constraints in Reinforcement Learning Based Active Object Recognition
In real world scenes, objects to be classified are usually not visible from every direction, since they are almost always positioned on some kind of opaque plane. When moving a cam...
Christian Derichs, Heinrich Niemann
152
Voted
ATAL
2005
Springer
15 years 8 months ago
Improving reinforcement learning function approximators via neuroevolution
Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...
Shimon Whiteson
179
Voted
ABIALS
2008
Springer
15 years 4 months ago
Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning
Abstract. In order to establish autonomous behavior for technical systems, the well known trade-off between reactive control and deliberative planning has to be considered. Within ...
Matthias Rungger, Hao Ding, Olaf Stursberg
112
Voted
NIPS
2001
15 years 4 months ago
Improvisation and Learning
This article presents a 2-phase computational learning model and application. As a demonstration, a system has been built, called CHIME for Computer Human Interacting Musical Enti...
Judy A. Franklin
103
Voted
GECCO
2005
Springer
155views Optimization» more  GECCO 2005»
15 years 8 months ago
Co-evolving recurrent neurons learn deep memory POMDPs
Recurrent neural networks are theoretically capable of learning complex temporal sequences, but training them through gradient-descent is too slow and unstable for practical use i...
Faustino J. Gomez, Jürgen Schmidhuber