Sciweavers

95 search results - page 7 / 19

» Policy Gradients for Cryptanalysis

159

AIPS
2007

81views Artificial Intelligence» more AIPS 2007»

Gradient-Based Relational Reinforcement Learning of Temporally Extended Policies

15 years 9 months ago

Gradient-Based Relational Reinforcement Learning of Temporally Extended Policies

Download www.cs.umd.edu

Charles Gretton

claim paper

Read More »

134

IGPL
2010

83views more IGPL 2010»

Recurrent policy gradients

15 years 5 months ago

Recurrent policy gradients

Download www.idsia.ch

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

147

RAS
2010

220views more RAS 2010»

Policy gradient learning for quadruped soccer robots

15 years 1 months ago

Policy gradient learning for quadruped soccer robots

Download www.irisa.fr

Andrea Cherubini, Francesca Giannone, Luca Iocchi,...

claim paper

Read More »

168

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 8 months ago

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

172

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Multi-Agent Learning with Policy Prediction

15 years 8 months ago

Multi-Agent Learning with Policy Prediction

Download www.cs.umass.edu

Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

« Prev « First page 7 / 19 Last » Next »