Search Sciweavers | Sciweavers

61 search results - page 1 / 13

» Convergence of synchronous reinforcement learning with linea...

click to vote

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

15 years 5 months ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

click to vote

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

14 years 6 months ago

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation

15 years 5 months ago

Download www.machinelearning.org

Reinforcement learning algorithms can become unstable when combined with linear function approximation. Algorithms that minimize the mean-square Bellman error are guaranteed to co...

Chee Wee Phua, Robert Fitch

claim paper

Read More »

click to vote

ICML
2003
IEEE

146views Machine Learning» more ICML 2003»

TD(0) Converges Provably Faster than the Residual Gradient Algorithm

15 years 5 months ago

Download www.hpl.hp.com

In Reinforcement Learning (RL) there has been some experimental evidence that the residual gradient algorithm converges slower than the TD(0) algorithm. In this paper, we use the ...

Ralf Schoknecht, Artur Merke

claim paper

Read More »

click to vote

AAMAS
2007
Springer

142views Intelligent Agents» more AAMAS 2007»

Parallel Reinforcement Learning with Linear Function Approximation

14 years 5 months ago

Download www.aamas-conference.org

In this paper, we investigate the use of parallelization in reinforcement learning (RL), with the goal of learning optimal policies for single-agent RL problems more quickly by us...

Matthew Grounds, Daniel Kudenko

claim paper

Read More »

« Prev « First page 1 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers