Sciweavers

437 search results - page 9 / 88

» Policy Gradient Critics

121

AIPS
2007

76views Artificial Intelligence» more AIPS 2007»

FF + FPG: Guiding a Policy-Gradient Planner

15 years 9 months ago

FF + FPG: Guiding a Policy-Gradient Planner

Download www.aaai.org

Olivier Buffet, Douglas Aberdeen

claim paper

Read More »

155

AIPS
2007

81views Artificial Intelligence» more AIPS 2007»

Gradient-Based Relational Reinforcement Learning of Temporally Extended Policies

15 years 9 months ago

Gradient-Based Relational Reinforcement Learning of Temporally Extended Policies

Download www.cs.umd.edu

Charles Gretton

claim paper

Read More »

131

Voted

IGPL
2010

83views more IGPL 2010»

Recurrent policy gradients

15 years 5 months ago

Recurrent policy gradients

Download www.idsia.ch

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

144

RAS
2010

220views more RAS 2010»

Policy gradient learning for quadruped soccer robots

15 years 1 months ago

Policy gradient learning for quadruped soccer robots

Download www.irisa.fr

Andrea Cherubini, Francesca Giannone, Luca Iocchi,...

claim paper

Read More »

165

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 8 months ago

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

« Prev « First page 9 / 88 Last » Next »