Sciweavers

272 search results - page 19 / 55
» Parallel Reinforcement Learning with Linear Function Approxi...
Sort
View
ECAI
2006
Springer
13 years 11 months ago
Least Squares SVM for Least Squares TD Learning
Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...
Tobias Jung, Daniel Polani
PKDD
2009
Springer
152views Data Mining» more  PKDD 2009»
14 years 2 months ago
Feature Selection for Value Function Approximation Using Bayesian Model Selection
Abstract. Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of th...
Tobias Jung, Peter Stone
ICPR
2006
IEEE
14 years 8 months ago
Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network
To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy o...
Siwei Luo, Yu Zheng, Ziang Lv
NIPS
1994
13 years 8 months ago
Reinforcement Learning with Soft State Aggregation
It is widely accepted that the use of more compact representations than lookup tables is crucial to scaling reinforcement learning (RL) algorithms to real-world problems. Unfortun...
Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...
ADCM
2006
74views more  ADCM 2006»
13 years 7 months ago
Linearly constrained reconstruction of functions by kernels with applications to machine learning
This paper investigates the approximation of multivariate functions from data via linear combinations of translates of a positive definite kernel from a reproducing kernel Hilbert...
Robert Schaback, J. Werner