Search Sciweavers | Sciweavers

115 search results - page 3 / 23

» Recurrent policy gradients

208

click to vote

UAI
2008

234views Artificial Intelligence» more UAI 2008»

Improving Gradient Estimation by Incorporating Sensor Data

15 years 8 months ago

Download www.cs.berkeley.edu

An efficient policy search algorithm should estimate the local gradient of the objective function, with respect to the policy parameters, from as few trials as possible. Whereas m...

Gregory Lawrence, Stuart J. Russell

claim paper

Read More »

198

Voted

SIAMCO
2008

112views more SIAMCO 2008»

A Knowledge-Gradient Policy for Sequential Information Collection

15 years 7 months ago

Download www.castlelab.princeton.edu

In a sequential Bayesian ranking and selection problem with independent normal populations and common known variance, we study a previously introduced measurement policy which we ...

Peter Frazier, Warren B. Powell, Savas Dayanik

claim paper

Read More »

192

click to vote

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

16 years 8 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

229

Voted

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

16 years 28 days ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

164

click to vote

ICML
2003
IEEE

117views Machine Learning» more ICML 2003»

Model-based Policy Gradient Reinforcement Learning

16 years 8 months ago

Sciweavers

Explore & Download

Productivity Tools

Sciweavers