Search Sciweavers | Sciweavers

510 search results - page 5 / 102

» Gradient Estimation Revitalized

227

click to vote

CORR
2011
Springer

167views Education» more CORR 2011»

Fast global convergence of gradient methods for high-dimensional statistical recovery

15 years 1 months ago

Download www.cs.berkeley.edu

Many statistical M-estimators are based on convex optimization problems formed by the weighted sum of a loss function with a norm-based regularizer. We analyze the convergence rat...

Alekh Agarwal, Sahand Negahban, Martin J. Wainwrig...

claim paper

Read More »

177

click to vote

ICIP
2009
IEEE

262views Image Processing» more ICIP 2009»

Single image defocus map estimation using local contrast prior

15 years 4 months ago

Download yuwing.kaist.ac.kr

Image defocus estimation is useful for several applications including deblurring, blur magnification, measuring image quality, and depth of field segmentation. In this paper, we p...

Yu-Wing Tai, Michael S. Brown

claim paper

Read More »

161

click to vote

CDC
2010
IEEE

147views Control Systems» more CDC 2010»

Estimation of general nonlinear state-space systems

15 years 1 months ago

Download www.control.isy.liu.se

This paper presents a novel approach to the estimation of a general class of dynamic nonlinear system models. The main contribution is the use of a tool from mathematical statistic...

Brett Ninness, Adrian Wills, Thomas B. Schön

claim paper

Read More »

149

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 11 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

169

click to vote

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 8 months ago

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

« Prev « First page 5 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers