Search Sciweavers | Sciweavers

1235 search results - page 198 / 247

» Reinforcement learning in a nutshell

109

Voted

ICRA
1994
IEEE

105views Robotics» more ICRA 1994»

Harmonic Functions and Collision Probabilities

15 years 7 months ago

Download www.cs.cmu.edu

There is a close relationship between harmonic functions { which have recently been proposed for path planning { and hitting probabilities for random processes. The hitting probab...

Christopher I. Connolly

claim paper

Read More »

160

Voted

ROBOCUP
2000
Springer

130views Robotics» more ROBOCUP 2000»

Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition

15 years 7 months ago

Download www.er.ams.eng.osaka-u.ac.jp

Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...

Yasutake Takahashi, Masanori Takeda, Minoru Asada

claim paper

Read More »

120

Voted

ESANN
2008

125views Neural Networks» more ESANN 2008»

Improvement in Game Agent Control Using State-Action Value Scaling

15 years 5 months ago

Download www.dice.ucl.ac.be

The aim of this paper is to enhance the performance of a reinforcement learning game agent controller, within a dynamic game environment, through the retention of learned informati...

Leo Galway, Darryl Charles, Michaela M. Black

claim paper

Read More »

123

Voted

ESANN
2004

90views Neural Networks» more ESANN 2004»

High-accuracy value-function approximation with neural networks applied to the acrobot

15 years 4 months ago

Download remi.coulom.free.fr

Several reinforcement-learning techniques have already been applied to the Acrobot control problem, using linear function approximators to estimate the value function. In this pape...

Rémi Coulom

claim paper

Read More »

124

Voted

NIPS
2004

92views Information Technology» more NIPS 2004»

Responding to Modalities with Different Latencies

15 years 4 months ago

Download books.nips.cc

Motor control depends on sensory feedback in multiple modalities with different latencies. In this paper we consider within the framework of reinforcement learning how different s...

Fredrik Bissmarck, Hiroyuki Nakahara, Kenji Doya, ...

claim paper

Read More »

« Prev « First page 198 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers