Sciweavers

704 search results - page 10 / 141
» Learning the Ideal Evaluation Function
Sort
View
ICML
2010
IEEE
13 years 12 months ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...
DAS
2010
Springer
14 years 2 months ago
Towards more effective distance functions for word image matching
Matching word images has many applications in document recognition and retrieval systems. Dynamic Time Warping (DTW) is popularly used to estimate the similarity between word imag...
Raman Jain, C. V. Jawahar
ICCBR
2005
Springer
14 years 4 months ago
CBR for State Value Function Approximation in Reinforcement Learning
CBR is one of the techniques that can be applied to the task of approximating a function over high-dimensional, continuous spaces. In Reinforcement Learning systems a learning agen...
Thomas Gabel, Martin A. Riedmiller
HICSS
2003
IEEE
120views Biometrics» more  HICSS 2003»
14 years 4 months ago
Evaluating On-line Learning Platforms: a Case Study
Our “information-oriented” society shows an increasing exigency of life-long learning. In such framework, online Learning is becoming an important tool to allow the flexibilit...
Francesco Colace, Massimo De Santo, Mario Vento
ATAL
2010
Springer
14 years 2 hour ago
Basis function construction for hierarchical reinforcement learning
This paper introduces an approach to automatic basis function construction for Hierarchical Reinforcement Learning (HRL) tasks. We describe some considerations that arise when con...
Sarah Osentoski, Sridhar Mahadevan