Search Sciweavers | Sciweavers

14 search results - page 1 / 3

» Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

196

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 8 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

176

click to vote

IJCAI
2003

169views Artificial Intelligence» more IJCAI 2003»

Covariant Policy Search

15 years 8 months ago

Download www.ri.cmu.edu

We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...

J. Andrew Bagnell, Jeff G. Schneider

claim paper

Read More »

178

click to vote

ORL
2007

112views more ORL 2007»

Competitive analysis of a dispatch policy for a dynamic multi-period routing problem

15 years 6 months ago

Download www2.isye.gatech.edu

We analyze a simple and natural on-line algorithm (dispatch policy) for a dynamic multiperiod uncapacitated routing problem, in which at the beginning of each time period a set of...

Enrico Angelelli, Martin W. P. Savelsbergh, Maria ...

claim paper

Read More »

172

Voted

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

16 years 25 days ago

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...

Pawel Wawrzynski

claim paper

Read More »

189

Voted

COMPUTING
2004

204views more COMPUTING 2004»

Image Registration by a Regularized Gradient Flow. A Streaming Implementation in DX9 Graphics Hardware

15 years 6 months ago

Download www.mpi-inf.mpg.de

The presented image registration method uses a regularized gradient flow to correlate the intensities in two images. Thereby, an energy functional is successively minimized by des...

Robert Strzodka, Marc Droske, Martin Rumpf

claim paper

Read More »

« Prev « First page 1 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers