Search Sciweavers | Sciweavers

97 search results - page 9 / 20

» Logarithmic Regret Algorithms for Online Convex Optimization

click to vote

ECCC
2007

180views more ECCC 2007»

Adaptive Algorithms for Online Decision Problems

13 years 7 months ago

Download ftp.cs.princeton.edu

We study the notion of learning in an oblivious changing environment. Existing online learning algorithms which minimize regret are shown to converge to the average of all locally...

Elad Hazan, C. Seshadhri

claim paper

Read More »

click to vote

COLT
2008
Springer

140views Machine Learning» more COLT 2008»

Regret Bounds for Sleeping Experts and Bandits

13 years 9 months ago

Download colt2008.cs.helsinki.fi

We study on-line decision problems where the set of actions that are available to the decision algorithm vary over time. With a few notable exceptions, such problems remained larg...

Robert D. Kleinberg, Alexandru Niculescu-Mizil, Yo...

claim paper

Read More »

click to vote

NIPS
2007

135views Information Technology» more NIPS 2007»

The Price of Bandit Information for Online Optimization

13 years 9 months ago

Download books.nips.cc

In the online linear optimization problem, a learner must choose, in each round, a decision from a set D ⊂ Rn in order to minimize an (unknown and changing) linear cost function...

Varsha Dani, Thomas P. Hayes, Sham Kakade

claim paper

Read More »

click to vote

CORR
2011
Springer

202views Education» more CORR 2011»

Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems

13 years 2 months ago

Download www.ualberta.ca

The analysis of online least squares estimation is at the heart of many stochastic sequential decision-making problems. We employ tools from the self-normalized processes to provi...

Yasin Abbasi-Yadkori, Dávid Pál, Csa...

claim paper

Read More »

click to vote

JMLR
2010

161views more JMLR 2010»

Dual Averaging Methods for Regularized Stochastic Learning and Online Optimization

13 years 2 months ago

Download jmlr.csail.mit.edu

We consider regularized stochastic learning and online optimization problems, where the objective function is the sum of two convex terms: one is the loss function of the learning...

Lin Xiao

claim paper

Read More »

« Prev « First page 9 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers