Search Sciweavers | Sciweavers

13 search results - page 1 / 3

» Beyond Logarithmic Bounds in Online Learning

232

click to vote

JMLR
2012

239views Programming Languages» more JMLR 2012»

Beyond Logarithmic Bounds in Online Learning

13 years 8 months ago

Download francesco.orabona.com

We prove logarithmic regret bounds that depend on the loss L∗ T of the competitor rather than on the number T of time steps. In the general online convex optimization setting, o...

Francesco Orabona, Nicolò Cesa-Bianchi, Cla...

claim paper

Read More »

147

click to vote

ALT
2008
Springer

141views Machine Learning» more ALT 2008»

Online Regret Bounds for Markov Decision Processes with Deterministic Transitions

16 years 3 months ago

Download personal.unileoben.ac.at

Abstract. We consider an upper conﬁdence bound algorithm for Markov decision processes (MDPs) with deterministic transitions. For this algorithm we derive upper bounds on the onl...

Ronald Ortner

claim paper

Read More »

174

click to vote

COLT
2006
Springer

179views Machine Learning» more COLT 2006»

Logarithmic Regret Algorithms for Online Convex Optimization

15 years 10 months ago

Download www.cs.princeton.edu

In an online convex optimization problem a decision-maker makes a sequence of decisions, i.e., chooses a sequence of points in Euclidean space, from a fixed feasible set. After ea...

Elad Hazan, Adam Kalai, Satyen Kale, Amit Agarwal

claim paper

Read More »

217

click to vote

CORR
2011
Springer

210views Education» more CORR 2011»

Online Learning of Rested and Restless Bandits

15 years 1 months ago

Download www.eecs.umich.edu

In this paper we study the online learning problem involving rested and restless multiarmed bandits with multiple plays. The system consists of a single player/user and a set of K...

Cem Tekin, Mingyan Liu

claim paper

Read More »

176

Voted

ICML
2007
IEEE

166views Machine Learning» more ICML 2007»

Online kernel PCA with entropic matrix updates

16 years 7 months ago

Download www.machinelearning.org

A number of updates for density matrices have been developed recently that are motivated by relative entropy minimization problems. The updates involve a softmin calculation based...

Dima Kuzmin, Manfred K. Warmuth

claim paper

Read More »

« Prev « First page 1 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers