Search Sciweavers | Sciweavers

272 search results - page 35 / 55

» Parallel Reinforcement Learning with Linear Function Approxi...

176

click to vote

GECCO
2006
Springer

196views Optimization» more GECCO 2006»

An anticipatory approach to improve XCSF

15 years 10 months ago

Download www.cs.bham.ac.uk

XCSF is a novel version of learning classifier systems (LCS) which extends the typical concept of LCS by introducing computable classifier prediction. In XCSF Classifier predictio...

Amin Nikanjam, Adel Torkaman Rahmani

claim paper

Read More »

203

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 8 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

199

click to vote

ML
2002
ACM

168views Machine Learning» more ML 2002»

On Average Versus Discounted Reward Temporal-Difference Learning

15 years 6 months ago

Download web.mit.edu

We provide an analytical comparison between discounted and average reward temporal-difference (TD) learning with linearly parameterized approximations. We first consider the asympt...

John N. Tsitsiklis, Benjamin Van Roy

claim paper

Read More »

208

click to vote

IPPS
2009
IEEE

110views Distributed And Parallel Com...» more IPPS 2009»

Multi-users scheduling in parallel systems

16 years 1 months ago

Download moais.imag.fr

We are interested in this paper to study scheduling problems in systems where many users compete to perform their respective jobs on shared parallel resources. Each user has speci...

Erik Saule, Denis Trystram

claim paper

Read More »

209

click to vote

PKDD
2010
Springer

169views Data Mining» more PKDD 2010»

Classification with Sums of Separable Functions

15 years 4 months ago

Download www.math.tu-berlin.de

Abstract. We present a novel approach for classification using a discretised function representation which is independent of the data locations. We construct the classifier as a su...

Jochen Garcke

claim paper

Read More »

« Prev « First page 35 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers