Sciweavers

95 search results - page 7 / 19
» Policy Gradients for Cryptanalysis
Sort
View
IGPL
2010
83views more  IGPL 2010»
13 years 6 months ago
Recurrent policy gradients
Daan Wierstra, Alexander Förster, Jan Peters,...
RAS
2010
220views more  RAS 2010»
13 years 2 months ago
Policy gradient learning for quadruped soccer robots
Andrea Cherubini, Francesca Giannone, Luca Iocchi,...
NIPS
2001
13 years 9 months ago
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning
We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...
Gregory Z. Grudic, Lyle H. Ungar
AAAI
2010
13 years 9 months ago
Multi-Agent Learning with Policy Prediction
Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...
Chongjie Zhang, Victor R. Lesser