Sciweavers

606 search results - page 59 / 122
» Least-Commitment Action Selection
Sort
View
IVA
2010
Springer
13 years 6 months ago
Smart Events and Primed Agents
We describe a new organization for virtual human responses to dynamically occurring events. In our approach behavioral responses are enumerated in the representation of the event i...
Catherine Stocker, Libo Sun, Pengfei Huang, Wenhu ...
ICRA
2010
IEEE
117views Robotics» more  ICRA 2010»
13 years 6 months ago
Learning reliable and efficient navigation with a humanoid
Reliable and efficient navigation with a humanoid robot is a difficult task. First, the motion commands are executed rather inaccurately due to backlash in the joints or foot slipp...
Stefan Oßwald, Armin Hornung, Maren Bennewit...
EUROCAST
2007
Springer
182views Hardware» more  EUROCAST 2007»
14 years 2 months ago
A k-NN Based Perception Scheme for Reinforcement Learning
Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...
José Antonio Martin H., Javier de Lope Asia...
IWANN
1999
Springer
14 years 11 days ago
Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning
To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...
R. Matthew Kretchmar, Charles W. Anderson
IROS
2008
IEEE
125views Robotics» more  IROS 2008»
14 years 2 months ago
Dynamic correlation matrix based multi-Q learning for a multi-robot system
—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...
Hongliang Guo, Yan Meng