Search Sciweavers | Sciweavers

91 search results - page 4 / 19

» Parameter-exploring policy gradients

191

click to vote

CEC
2011
IEEE

221views Artificial Intelligence» more CEC 2011»

Stochastic Natural Gradient Descent by estimation of empirical covariances

14 years 5 months ago

Download chrome.ws.dei.polimi.it

—Stochastic relaxation aims at ﬁnding the minimum of a ﬁtness function by identifying a proper sequence of distributions, in a given model, that minimize the expected value o...

Luigi Malagò, Matteo Matteucci, Giovanni Pi...

claim paper

Read More »

171

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 7 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

160

click to vote

IJCAI
2003

169views Artificial Intelligence» more IJCAI 2003»

Covariant Policy Search

15 years 7 months ago

Download www.ri.cmu.edu

We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...

J. Andrew Bagnell, Jeff G. Schneider

claim paper

Read More »

127

click to vote

AIPS
2007

119views Artificial Intelligence» more AIPS 2007»

Concurrent Probabilistic Temporal Planning with Policy-Gradients

15 years 7 months ago

Download eprints.pascal-network.org

We present an any-time concurrent probabilistic temporal planner that includes continuous and discrete uncertainties and metric functions. Our approach is a direct policy search t...

Douglas Aberdeen, Olivier Buffet

claim paper

Read More »

155

click to vote

AAAI
2010

191views Intelligent Agents» more AAAI 2010»

Relative Entropy Policy Search

15 years 7 months ago

Download www.kyb.tuebingen.mpg.de

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...

Jan Peters, Katharina Mülling, Yasemin Altun

claim paper

Read More »

« Prev « First page 4 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers