Search Sciweavers | Sciweavers

272 search results - page 29 / 55

» Parallel Reinforcement Learning with Linear Function Approxi...

195

click to vote

ICML
2009
IEEE

120views Machine Learning» more ICML 2009»

Learning linear dynamical systems without sequence information

16 years 1 months ago

Download www.cs.mcgill.ca

Virtually all methods of learning dynamic systems from data start from the same basic assumption: that the learning algorithm will be provided with a sequence, or trajectory, of d...

Tzu-Kuo Huang, Jeff Schneider

claim paper

Read More »

228

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

16 years 3 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

182

click to vote

ML
2002
ACM

154views Machine Learning» more ML 2002»

Technical Update: Least-Squares Temporal Difference Learning

15 years 6 months ago

Download www.research.rutgers.edu

TD() is a popular family of algorithms for approximate policy evaluation in large MDPs. TD() works by incrementally updating the value function after each observed transition. It h...

Justin A. Boyan

claim paper

Read More »

158

click to vote

IJCNN
2006
IEEE

117views Neural Networks» more IJCNN 2006»

Learning to Rank by Maximizing AUC with Linear Programming

16 years 26 days ago

Download dollar.biz.uiowa.edu

— Area Under the ROC Curve (AUC) is often used to evaluate ranking performance in binary classiﬁcation problems. Several researchers have approached AUC optimization by approxi...

Kaan Ataman, W. Nick Street, Yi Zhang

claim paper

Read More »

172

click to vote

GECCO
2009
Springer

200views Optimization» more GECCO 2009»

Apply ant colony optimization to Tetris

16 years 1 months ago

Download cs.nju.edu.cn

Tetris is a falling block game where the player’s objective is to arrange a sequence of diﬀerent shaped tetrominoes smoothly in order to survive. In the intelligence games, ag...

Xingguo Chen, Hao Wang, Weiwei Wang, Yinghuan Shi,...

claim paper

Read More »

« Prev « First page 29 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers