Search Sciweavers | Sciweavers

510 search results - page 6 / 102

» Gradient Estimation Revitalized

180

click to vote

CORR
2004
Springer

103views Education» more CORR 2004»

Online convex optimization in the bandit setting: gradient descent without a gradient

15 years 6 months ago

Download www.cs.cmu.edu

We study a general online convex optimization problem. We have a convex set S and an unknown sequence of cost functions c1, c2, . . . , and in each period, we choose a feasible po...

Abraham Flaxman, Adam Tauman Kalai, H. Brendan McM...

claim paper

Read More »

197

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

15 years 6 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

176

click to vote

COLT
2008
Springer

143views Machine Learning» more COLT 2008»

Learning Coordinate Gradients with Multi-Task Kernels

15 years 8 months ago

Download colt2008.cs.helsinki.fi

Coordinate gradient learning is motivated by the problem of variable selection and determining variable covariation. In this paper we propose a novel unifying framework for coordi...

Yiming Ying, Colin Campbell

claim paper

Read More »

167

click to vote

ICASSP
2011
IEEE

131views Signal Processing» more ICASSP 2011»

Direction-of-arrival estimation using acoustic vector sensors in the presence of noise

14 years 10 months ago

Download mirlab.org

A vector-sensor consisting of a monopole sensor collocated with orthogonally oriented dipole sensors can be used for direction-ofarrival (DOA) estimation. A method is proposed to ...

Dovid Levin, Sharon Gannot, Emanuel A. P. Habets

claim paper

Read More »

188

click to vote

IOR
2011

107views more IOR 2011»

Information Collection on a Graph

15 years 1 months ago

Download www.castlelab.princeton.edu

We derive a knowledge gradient policy for an optimal learning problem on a graph, in which we use sequential measurements to reﬁne Bayesian estimates of individual edge values i...

Ilya O. Ryzhov, Warren B. Powell

claim paper

Read More »

« Prev « First page 6 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers