Search Sciweavers | Sciweavers

195

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 8 months ago

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

161

click to vote

ANOR
2004

155views more ANOR 2004»

Model-Based Search for Combinatorial Optimization: A Critical Survey

15 years 6 months ago

Download code.ulb.ac.be

In this paper we introduce model-based search as a unifying framework accommodating some recently proposed metaheuristics for combinatorial optimization such as ant colony optimiza...

Mark Zlochin, Mauro Birattari, Nicolas Meuleau, Ma...

claim paper

Read More »

176

click to vote

IJCAI
2003

169views Artificial Intelligence» more IJCAI 2003»

Covariant Policy Search

15 years 8 months ago

Download www.ri.cmu.edu

We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...

J. Andrew Bagnell, Jeff G. Schneider

claim paper

Read More »

148

click to vote

AIPS
2007

119views Artificial Intelligence» more AIPS 2007»

Concurrent Probabilistic Temporal Planning with Policy-Gradients

15 years 9 months ago

Download eprints.pascal-network.org

We present an any-time concurrent probabilistic temporal planner that includes continuous and discrete uncertainties and metric functions. Our approach is a direct policy search t...

Douglas Aberdeen, Olivier Buffet

claim paper

Read More »

192

click to vote

AAAI
2011

145views Intelligent Agents» more AAAI 2011»

Policy Gradient Planning for Environmental Decision Making with Existing Simulators

14 years 6 months ago

Download www.cs.ubc.ca

In environmental and natural resource planning domains actions are taken at a large number of locations over multiple time periods. These problems have enormous state and action s...

Mark Crowley, David Poole

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers