Search Sciweavers | Sciweavers

4544 search results - page 197 / 909

» Reinforcement Learning with Time

204

Voted

ICML
2004
IEEE

163views Machine Learning» more ICML 2004»

Multi-task feature and kernel selection for SVMs

16 years 8 months ago

Download www1.cs.columbia.edu

We compute a common feature selection or kernel selection configuration for multiple support vector machines (SVMs) trained on different yet inter-related datasets. The method is ...

Tony Jebara

claim paper

Read More »

169

click to vote

ICML
2003
IEEE

146views Machine Learning» more ICML 2003»

TD(0) Converges Provably Faster than the Residual Gradient Algorithm

16 years 8 months ago

Download www.hpl.hp.com

In Reinforcement Learning (RL) there has been some experimental evidence that the residual gradient algorithm converges slower than the TD(0) algorithm. In this paper, we use the ...

Ralf Schoknecht, Artur Merke

claim paper

Read More »

170

click to vote

IJCNN
2006
IEEE

111views Neural Networks» more IJCNN 2006»

Training Coordination Proxy Agents

16 years 1 months ago

Download cs.itd.nrl.navy.mil

— Delegating the coordination role to proxy agents can improve the overall outcome of the task at the expense of cognitive overload due to switching subtasks. Stability and commi...

Myriam Abramson, William Chao, Ranjeev Mittu

claim paper

Read More »

215

Voted

DEXA
2004
Springer

172views Database» more DEXA 2004»

On the Automation of Similarity Information Maintenance in Flexible Query Answering Systems

16 years 26 days ago

Download www.faw.uni-linz.ac.at

This paper proposes a method for automatic maintaining the similarity information for a particular class of Flexible Query Answering Systems (FQAS). The paper describes the three m...

Balázs Csanád Csáji, Josef K&...

claim paper

Read More »

177

click to vote

CEC
2003
IEEE

102views Artificial Intelligence» more CEC 2003»

Real-time adaptation technique to real robots: an experiment with a humanoid robot

16 years 24 days ago

Download www.iba.t.u-tokyo.ac.jp

We introduce a technique that allows a real robot to execute real-time learning, in which GP and RL are integrated. In our former research, we showed the result of an experiment wi...

Shotaro Kamio, Hitoshi Iba

claim paper

Read More »

« Prev « First page 197 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers