Search Sciweavers | Sciweavers

4544 search results - page 79 / 909

» Reinforcement Learning with Time

171

click to vote

DAGM
2006
Springer

121views Image Processing» more DAGM 2006»

Handling Camera Movement Constraints in Reinforcement Learning Based Active Object Recognition

15 years 10 months ago

Download www5.informatik.uni-erlangen.de

In real world scenes, objects to be classified are usually not visible from every direction, since they are almost always positioned on some kind of opaque plane. When moving a cam...

Christian Derichs, Heinrich Niemann

claim paper

Read More »

205

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

15 years 11 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

226

click to vote

ABIALS
2008
Springer

255views Artificial Intelligence» more ABIALS 2008»

Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning

15 years 8 months ago

Download axon.cs.byu.edu

Abstract. In order to establish autonomous behavior for technical systems, the well known trade-off between reactive control and deliberative planning has to be considered. Within ...

Matthias Rungger, Hao Ding, Olaf Stursberg

claim paper

Read More »

157

click to vote

NIPS
2001

106views Information Technology» more NIPS 2001»

Improvisation and Learning

15 years 7 months ago

Download books.nips.cc

This article presents a 2-phase computational learning model and application. As a demonstration, a system has been built, called CHIME for Computer Human Interacting Musical Enti...

Judy A. Franklin

claim paper

Read More »

142

click to vote

GECCO
2005
Springer

155views Optimization» more GECCO 2005»

Co-evolving recurrent neurons learn deep memory POMDPs

15 years 11 months ago

Download www.idsia.ch

Recurrent neural networks are theoretically capable of learning complex temporal sequences, but training them through gradient-descent is too slow and unstable for practical use i...

Faustino J. Gomez, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 79 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers