Sciweavers

4544 search results - page 197 / 909
» Reinforcement Learning with Time
Sort
View
ICML
2004
IEEE
14 years 11 months ago
Multi-task feature and kernel selection for SVMs
We compute a common feature selection or kernel selection configuration for multiple support vector machines (SVMs) trained on different yet inter-related datasets. The method is ...
Tony Jebara
ICML
2003
IEEE
14 years 11 months ago
TD(0) Converges Provably Faster than the Residual Gradient Algorithm
In Reinforcement Learning (RL) there has been some experimental evidence that the residual gradient algorithm converges slower than the TD(0) algorithm. In this paper, we use the ...
Ralf Schoknecht, Artur Merke
IJCNN
2006
IEEE
14 years 4 months ago
Training Coordination Proxy Agents
— Delegating the coordination role to proxy agents can improve the overall outcome of the task at the expense of cognitive overload due to switching subtasks. Stability and commi...
Myriam Abramson, William Chao, Ranjeev Mittu
DEXA
2004
Springer
172views Database» more  DEXA 2004»
14 years 3 months ago
On the Automation of Similarity Information Maintenance in Flexible Query Answering Systems
This paper proposes a method for automatic maintaining the similarity information for a particular class of Flexible Query Answering Systems (FQAS). The paper describes the three m...
Balázs Csanád Csáji, Josef K&...
CEC
2003
IEEE
14 years 3 months ago
Real-time adaptation technique to real robots: an experiment with a humanoid robot
We introduce a technique that allows a real robot to execute real-time learning, in which GP and RL are integrated. In our former research, we showed the result of an experiment wi...
Shotaro Kamio, Hitoshi Iba