Search Sciweavers | Sciweavers

539 search results - page 64 / 108

» Learning Monotonic Linear Functions

168

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

16 years 6 months ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

154

click to vote

JMLR
2010

103views more JMLR 2010»

Learning Nonlinear Dynamic Models from Non-sequenced Data

15 years 25 days ago

Download www.cs.cmu.edu

Virtually all methods of learning dynamic systems from data start from the same basic assumption: the learning algorithm will be given a sequence of data generated from the dynami...

Tzu-Kuo Huang, Le Song, Jeff Schneider

claim paper

Read More »

155

click to vote

CVPR
2007
IEEE

213views Computer Vision» more CVPR 2007»

Discriminative Learning of Dynamical Systems for Motion Tracking

16 years 8 months ago

Download www.cs.rutgers.edu

We introduce novel discriminative learning algorithms for dynamical systems. Models such as Conditional Random Fields or Maximum Entropy Markov Models outperform the generative Hi...

Minyoung Kim, Vladimir Pavlovic

claim paper

Read More »

168

click to vote

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 9 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

143

click to vote

GECCO
2006
Springer

159views Optimization» more GECCO 2006»

Standard and averaging reinforcement learning in XCS

15 years 9 months ago

Download www.cs.bham.ac.uk

This paper investigates reinforcement learning (RL) in XCS. First, it formally shows that XCS implements a method of generalized RL based on linear approximators, in which the usu...

Pier Luca Lanzi, Daniele Loiacono

claim paper

Read More »

« Prev « First page 64 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers