Search Sciweavers | Sciweavers

272 search results - page 19 / 55

» Parallel Reinforcement Learning with Linear Function Approxi...

click to vote

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

13 years 11 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

click to vote

PKDD
2009
Springer

152views Data Mining» more PKDD 2009»

Feature Selection for Value Function Approximation Using Bayesian Model Selection

14 years 2 months ago

Download userweb.cs.utexas.edu

Abstract. Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of th...

Tobias Jung, Peter Stone

claim paper

Read More »

click to vote

ICPR
2006
IEEE

260views computer vision» more ICPR 2006»

Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network

14 years 8 months ago

Download ee2.chit.edu.tw

To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy o...

Siwei Luo, Yu Zheng, Ziang Lv

claim paper

Read More »

click to vote

NIPS
1994

90views Information Technology» more NIPS 1994»

Reinforcement Learning with Soft State Aggregation

13 years 8 months ago

Download www.eecs.umich.edu

It is widely accepted that the use of more compact representations than lookup tables is crucial to scaling reinforcement learning (RL) algorithms to real-world problems. Unfortun...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

click to vote

ADCM
2006

74views more ADCM 2006»

Linearly constrained reconstruction of functions by kernels with applications to machine learning

13 years 7 months ago

Download num.math.uni-goettingen.de

This paper investigates the approximation of multivariate functions from data via linear combinations of translates of a positive definite kernel from a reproducing kernel Hilbert...

Robert Schaback, J. Werner

claim paper

Read More »

« Prev « First page 19 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers