Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

135

UAI
2008

favoriteEmaildiscussreport

234views Artificial Intelligence» more UAI 2008»

Improving Gradient Estimation by Incorporating Sensor Data

15 years 3 months ago

Improving Gradient Estimation by Incorporating Sensor Data

Download www.cs.berkeley.edu

An efficient policy search algorithm should estimate the local gradient of the objective function, with respect to the policy parameters, from as few trials as possible. Whereas most policy search methods estimate this gradient by observing the rewards obtained during policy trials, we show, both theoretically and empirically, that taking into account the sensor data as well gives better gradient estimates and hence faster learning. The reason is that rewards obtained during policy execution vary from trial to trial due to noise in the environment; sensor data, which correlates with the noise, can be used to partially correct for this variation, resulting in an estimator with lower variance.

Gregory Lawrence, Stuart J. Russell

Real-time Traffic

Artificial Intelligence | Policy Search | Policy Search Algorithm | Policy Search Methods | UAI 2008 |

claim paper

Related Content

» Improving Density Estimation by Incorporating Spatial Information

» Improving phaseunwrapping result of InSAR images by incorporating the fractal model

» A Comparison of Gradient Estimation Methods for Volume Rendering on Unstructured Meshes

» Incorporation of Delayed Decision Making into Stochastic Mapping

» An electrothermallyaware fullchip substrate temperature gradient evaluation methodology fo...

» Toward HighQuality Gradient Estimation on Regular Lattices

» Incorporating Illumination Constraints in Deformable Models

» Multisensory speech processing incorporating automatically extracted hidden dynamic inform...

» The Optimal Reward Baseline for GradientBased Reinforcement Learning

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2008
Where	UAI
Authors	Gregory Lawrence, Stuart J. Russell

Comments (0)