Sciweavers

1235 search results - page 180 / 247
» ABC Reinforcement Learning
Sort
View
IJCNN
2006
IEEE
14 years 4 months ago
Training Coordination Proxy Agents
— Delegating the coordination role to proxy agents can improve the overall outcome of the task at the expense of cognitive overload due to switching subtasks. Stability and commi...
Myriam Abramson, William Chao, Ranjeev Mittu
ISCAS
2006
IEEE
103views Hardware» more  ISCAS 2006»
14 years 4 months ago
Towards autonomous adaptive behavior in a bio-inspired CNN-controlled robot
— This paper describes a general approach for the unsupervised learning of behaviors in a behavior-based robot. The key idea is to formalize a behavior produced by a Motor Map dr...
Paolo Arena, Luigi Fortuna, Mattia Frasca, Luca Pa...
DEXA
2004
Springer
172views Database» more  DEXA 2004»
14 years 3 months ago
On the Automation of Similarity Information Maintenance in Flexible Query Answering Systems
This paper proposes a method for automatic maintaining the similarity information for a particular class of Flexible Query Answering Systems (FQAS). The paper describes the three m...
Balázs Csanád Csáji, Josef K&...
CEC
2003
IEEE
14 years 3 months ago
Real-time adaptation technique to real robots: an experiment with a humanoid robot
We introduce a technique that allows a real robot to execute real-time learning, in which GP and RL are integrated. In our former research, we showed the result of an experiment wi...
Shotaro Kamio, Hitoshi Iba
EWRL
2008
13 years 11 months ago
Markov Decision Processes with Arbitrary Reward Processes
Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...
Jia Yuan Yu, Shie Mannor, Nahum Shimkin