Experiments in Value Function Approximation with Sparse Support Vector Regression

14 years 8 months ago

Download userweb.cs.utexas.edu

Abstract. We present ﬁrst experiments using Support Vector Regression as function approximator for an on-line, sarsa-like reinforcement learner. To overcome the batch nature of SVR two ideas are employed. The ﬁrst is sparse greedy approximation: the data is projected onto the subspace spanned by only a small subset of the original data (in feature space). This subset can be built up in an on-line fashion. Second, we use the sparsiﬁed data to solve a reduced quadratic problem, where the number of variables is independent of the total number of training samples seen. The feasability of this approach is demonstrated on two common toy-problems.

Tobias Jung, Thomas Uthmann

Real-time Traffic

ECML 2004 | Sarsa-like Reinforcement Learner | Sparse Greedy Approximation | Support Vector Regression |

claim paper

Post Info
More Details (n/a)

Added	01 Jul 2010
Updated	01 Jul 2010
Type	Conference
Year	2004
Where	ECML
Authors	Tobias Jung, Thomas Uthmann

Comments (0)

Sciweavers

Experiments in Value Function Approximation with Sparse Support Vector Regression

ECML 2004 | Sarsa-like Reinforcement Learner | Sparse Greedy Approximation | Support Vector Regression |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers