Search Sciweavers | Sciweavers

272 search results - page 4 / 55

» Parallel Reinforcement Learning with Linear Function Approxi...

197

Voted

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 7 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

159

click to vote

GECCO
2006
Springer

177views Optimization» more GECCO 2006»

Hyper-ellipsoidal conditions in XCS: rotation, linear approximation, and solution structure

15 years 10 months ago

Download www.eskimo.com

The learning classifier system XCS is an iterative rulelearning system that evolves rule structures based on gradient-based prediction and rule quality estimates. Besides classifi...

Martin V. Butz, Pier Luca Lanzi, Stewart W. Wilson

claim paper

Read More »

138

Voted

ICML
2008
IEEE

133views Machine Learning» more ICML 2008»

An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning

16 years 7 months ago

Download www.cs.duke.edu

We show that linear value-function approximation is equivalent to a form of linear model approximation. We then derive a relationship between the model-approximation error and the...

Ronald Parr, Lihong Li, Gavin Taylor, Christopher ...

claim paper

Read More »

188

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

15 years 1 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

171

Voted

ICML
2009
IEEE

185views Machine Learning» more ICML 2009»

Kernelized value function approximation for reinforcement learning

16 years 7 months ago

Sciweavers

Explore & Download

Productivity Tools

Sciweavers