Sciweavers

1233 search results - page 223 / 247
» Feudal Reinforcement Learning
Sort
View
ICML
2010
IEEE
13 years 9 months ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...
IIE
2007
105views more  IIE 2007»
13 years 8 months ago
Student-Centered Support Systems to Sustain Logo-Like Learning
Conventional wisdom attributes the lack of effective technology use in classrooms to a shortage of professional development or poorly run professional development. At the same time...
Sylvia Martinez
ICRA
1994
IEEE
105views Robotics» more  ICRA 1994»
14 years 14 days ago
Harmonic Functions and Collision Probabilities
There is a close relationship between harmonic functions { which have recently been proposed for path planning { and hitting probabilities for random processes. The hitting probab...
Christopher I. Connolly
ROBOCUP
2000
Springer
130views Robotics» more  ROBOCUP 2000»
13 years 12 months ago
Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition
Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...
Yasutake Takahashi, Masanori Takeda, Minoru Asada
ESANN
2008
13 years 9 months ago
Improvement in Game Agent Control Using State-Action Value Scaling
The aim of this paper is to enhance the performance of a reinforcement learning game agent controller, within a dynamic game environment, through the retention of learned informati...
Leo Galway, Darryl Charles, Michaela M. Black