Sciweavers

2108 search results - page 235 / 422
» Tracking in Reinforcement Learning
Sort
View
ICRA
1994
IEEE
105views Robotics» more  ICRA 1994»
14 years 2 months ago
Harmonic Functions and Collision Probabilities
There is a close relationship between harmonic functions { which have recently been proposed for path planning { and hitting probabilities for random processes. The hitting probab...
Christopher I. Connolly
ROBOCUP
2000
Springer
130views Robotics» more  ROBOCUP 2000»
14 years 2 months ago
Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition
Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...
Yasutake Takahashi, Masanori Takeda, Minoru Asada
ESANN
2008
14 years 6 days ago
Improvement in Game Agent Control Using State-Action Value Scaling
The aim of this paper is to enhance the performance of a reinforcement learning game agent controller, within a dynamic game environment, through the retention of learned informati...
Leo Galway, Darryl Charles, Michaela M. Black
ESANN
2004
14 years 5 days ago
High-accuracy value-function approximation with neural networks applied to the acrobot
Several reinforcement-learning techniques have already been applied to the Acrobot control problem, using linear function approximators to estimate the value function. In this pape...
Rémi Coulom
NIPS
2004
14 years 4 days ago
Responding to Modalities with Different Latencies
Motor control depends on sensory feedback in multiple modalities with different latencies. In this paper we consider within the framework of reinforcement learning how different s...
Fredrik Bissmarck, Hiroyuki Nakahara, Kenji Doya, ...