Extracting Certainty from Uncertainty: Regret Bounded by Variation in Costs

15 years 8 months ago

Download colt2008.cs.helsinki.fi

Prediction from expert advice is a fundamental problem in machine learning. A major pillar of the field is the existence of learning algorithms whose average loss approaches that of the best expert in hindsight (in other words, whose average regret approaches zero). Traditionally the regret of online algorithms was bounded in terms of the number of prediction rounds. Cesa-Bianchi, Mansour and Stoltz [4] posed the question whether it is be possible to bound the regret of an online algorithm by the variation of the observed costs. In this paper we resolve this question, and prove such bounds in the fully adversarial setting, in two important online learning scenarios: prediction from expert advice, and online linear optimization.

Elad Hazan, Satyen Kale

Real-time Traffic

COLT 2008 | Expert Advice | Machine Learning | Online Algorithm | Regret Approaches Zero |

claim paper

Post Info
More Details (n/a)

Added	18 Oct 2010
Updated	18 Oct 2010
Type	Conference
Year	2008
Where	COLT
Authors	Elad Hazan, Satyen Kale

Comments (0)

Sciweavers

Extracting Certainty from Uncertainty: Regret Bounded by Variation in Costs

COLT 2008 | Expert Advice | Machine Learning | Online Algorithm | Regret Approaches Zero |

Explore & Download

Productivity Tools

Sciweavers