Search Sciweavers | Sciweavers

536 search results - page 7 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

165

click to vote

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 8 months ago

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

204

click to vote

NN
2010
Springer

187views Neural Networks» more NN 2010»

Efficient exploration through active learning for value function approximation in reinforcement learning

15 years 1 months ago

Download sugiyama-www.cs.titech.ac.jp

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...

Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...

claim paper

Read More »

197

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 7 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

163

click to vote

ESANN
2006

114views Neural Networks» more ESANN 2006»

Reducing policy degradation in neuro-dynamic programming

15 years 8 months ago

Download ml.informatik.uni-freiburg.de

We focus on neuro-dynamic programming methods to learn state-action value functions and outline some of the inherent problems to be faced, when performing reinforcement learning in...

Thomas Gabel, Martin Riedmiller

claim paper

Read More »

180

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 8 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

« Prev « First page 7 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers