Sciweavers

437 search results - page 9 / 88
» Policy Gradient Critics
Sort
View
AIPS
2007
13 years 10 months ago
FF + FPG: Guiding a Policy-Gradient Planner
Olivier Buffet, Douglas Aberdeen
IGPL
2010
83views more  IGPL 2010»
13 years 6 months ago
Recurrent policy gradients
Daan Wierstra, Alexander Förster, Jan Peters,...
RAS
2010
220views more  RAS 2010»
13 years 2 months ago
Policy gradient learning for quadruped soccer robots
Andrea Cherubini, Francesca Giannone, Luca Iocchi,...
NIPS
2001
13 years 9 months ago
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning
We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...
Gregory Z. Grudic, Lyle H. Ungar