Search Sciweavers | Sciweavers

1512 search results - page 192 / 303

» Qualitative reinforcement learning

144

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

16 years 6 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

131

click to vote

ICML
2003
IEEE

146views Machine Learning» more ICML 2003»

TD(0) Converges Provably Faster than the Residual Gradient Algorithm

16 years 6 months ago

Download www.hpl.hp.com

In Reinforcement Learning (RL) there has been some experimental evidence that the residual gradient algorithm converges slower than the TD(0) algorithm. In this paper, we use the ...

Ralf Schoknecht, Artur Merke

claim paper

Read More »

136

click to vote

IJCNN
2006
IEEE

111views Neural Networks» more IJCNN 2006»

Training Coordination Proxy Agents

15 years 11 months ago

Download cs.itd.nrl.navy.mil

— Delegating the coordination role to proxy agents can improve the overall outcome of the task at the expense of cognitive overload due to switching subtasks. Stability and commi...

Myriam Abramson, William Chao, Ranjeev Mittu

claim paper

Read More »

142

click to vote

ISCAS
2006
IEEE

103views Hardware» more ISCAS 2006»

Towards autonomous adaptive behavior in a bio-inspired CNN-controlled robot

15 years 11 months ago

Download web.mit.edu

— This paper describes a general approach for the unsupervised learning of behaviors in a behavior-based robot. The key idea is to formalize a behavior produced by a Motor Map dr...

Paolo Arena, Luigi Fortuna, Mattia Frasca, Luca Pa...

claim paper

Read More »

156

click to vote

DEXA
2004
Springer

172views Database» more DEXA 2004»

On the Automation of Similarity Information Maintenance in Flexible Query Answering Systems

15 years 10 months ago

Download www.faw.uni-linz.ac.at

This paper proposes a method for automatic maintaining the similarity information for a particular class of Flexible Query Answering Systems (FQAS). The paper describes the three m...

Balázs Csanád Csáji, Josef K&...

claim paper

Read More »

« Prev « First page 192 / 303 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers