Sciweavers

1512 search results - page 192 / 303
» Qualitative reinforcement learning
Sort
View
ICML
2003
IEEE
14 years 10 months ago
Exploration in Metric State Spaces
We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...
Sham Kakade, Michael J. Kearns, John Langford
ICML
2003
IEEE
14 years 10 months ago
TD(0) Converges Provably Faster than the Residual Gradient Algorithm
In Reinforcement Learning (RL) there has been some experimental evidence that the residual gradient algorithm converges slower than the TD(0) algorithm. In this paper, we use the ...
Ralf Schoknecht, Artur Merke
IJCNN
2006
IEEE
14 years 3 months ago
Training Coordination Proxy Agents
— Delegating the coordination role to proxy agents can improve the overall outcome of the task at the expense of cognitive overload due to switching subtasks. Stability and commi...
Myriam Abramson, William Chao, Ranjeev Mittu
ISCAS
2006
IEEE
103views Hardware» more  ISCAS 2006»
14 years 3 months ago
Towards autonomous adaptive behavior in a bio-inspired CNN-controlled robot
— This paper describes a general approach for the unsupervised learning of behaviors in a behavior-based robot. The key idea is to formalize a behavior produced by a Motor Map dr...
Paolo Arena, Luigi Fortuna, Mattia Frasca, Luca Pa...
DEXA
2004
Springer
172views Database» more  DEXA 2004»
14 years 2 months ago
On the Automation of Similarity Information Maintenance in Flexible Query Answering Systems
This paper proposes a method for automatic maintaining the similarity information for a particular class of Flexible Query Answering Systems (FQAS). The paper describes the three m...
Balázs Csanád Csáji, Josef K&...