COLT 2010 | Sciweavers

194

COLT
2010
Springer

129views Machine Learning» more COLT 2010»

15 years 4 months ago

We consider a bandit problem which involves sequential sampling from two populations (arms). Each arm produces a noisy reward realization which depends on an observable random cov...

Philippe Rigollet, Assaf Zeevi

claim paper

Read More »

178

click to vote

COLT
2010
Springer

205views Machine Learning» more COLT 2010»

Convex Games in Banach Spaces

15 years 4 months ago

Download www.cs.utexas.edu

We study the regret of an online learner playing a multi-round game in a Banach space B against an adversary that plays a convex function at each round. We characterize the minima...

Karthik Sridharan, Ambuj Tewari

claim paper

Read More »

155

click to vote

COLT
2010
Springer

136views Machine Learning» more COLT 2010»

Improved Guarantees for Agnostic Learning of Disjunctions

15 years 4 months ago

Download www.cs.cmu.edu

Pranjal Awasthi, Avrim Blum, Or Sheffet

claim paper

Read More »

171

click to vote

COLT
2010
Springer

122views Machine Learning» more COLT 2010»

Inferring Descriptive Generalisations of Formal Languages

15 years 4 months ago

Download www.colt2010.org

In the present paper, we introduce a variant of Gold-style learners that is not required to infer precise descriptions of the languages in a class, but that must find descriptive ...

Dominik D. Freydenberger, Daniel Reidenbach

claim paper

Read More »

187

click to vote

COLT
2010
Springer

186views Machine Learning» more COLT 2010»

Following the Flattened Leader

15 years 4 months ago

Download www.colt2010.org

We analyze the regret, measured in terms of log loss, of the maximum likelihood (ML) sequential prediction strategy. This "follow the leader" strategy also defines one o...

Wojciech Kotlowski, Peter Grünwald, Steven de...

claim paper

Read More »

185

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 4 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

284

click to vote

COLT
2010
Springer

238views Machine Learning» more COLT 2010»

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

15 years 4 months ago

Download www.colt2010.org

We present a new family of subgradient methods that dynamically incorporate knowledge of the geometry of the data observed in earlier iterations to perform more informative gradie...

John Duchi, Elad Hazan, Yoram Singer

claim paper

Read More »

169

click to vote

COLT
2010
Springer

150views Machine Learning» more COLT 2010»

Robust Hierarchical Clustering

15 years 4 months ago

Download www.colt2010.org

One of the most widely used techniques for data clustering is agglomerative clustering. Such algorithms have been long used across many different fields ranging from computational...

Maria-Florina Balcan, Pramod Gupta

claim paper

Read More »

196

click to vote

COLT
2010
Springer

166views Machine Learning» more COLT 2010»

Deterministic Sparse Fourier Approximation via Fooling Arithmetic Progressions

15 years 4 months ago

Download www.colt2010.org

A significant Fourier transform (SFT) algorithm, given a threshold and oracle access to a function f, outputs (the frequencies and approximate values of) all the -significant Fou...

Adi Akavia

claim paper

Read More »

191

click to vote

COLT
2010
Springer

181views Machine Learning» more COLT 2010»

Composite Objective Mirror Descent

15 years 4 months ago

Download www.cs.berkeley.edu

We present a new method for regularized convex optimization and analyze it under both online and stochastic optimization settings. In addition to unifying previously known firstor...

John Duchi, Shai Shalev-Shwartz, Yoram Singer, Amb...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers