Search Sciweavers | Sciweavers

263 search results - page 2 / 53

» Regret Bounds for Prediction Problems

245

click to vote

JMLR
2012

200views Programming Languages» more JMLR 2012»

Contextual Bandit Learning with Predictable Rewards

13 years 10 months ago

Download www.cs.princeton.edu

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...

Alekh Agarwal, Miroslav Dudík, Satyen Kale,...

claim paper

Read More »

222

click to vote

COCOON
2006
Springer

121views Combinatorics» more COCOON 2006»

Approximating Min-Max (Regret) Versions of Some Polynomial Problems

15 years 11 months ago

Download www.lamsade.dauphine.fr

Abstract. While the complexity of min-max and min-max regret versions of most classical combinatorial optimization problems has been thoroughly investigated, there are very few stu...

Hassene Aissi, Cristina Bazgan, Daniel Vanderpoote...

claim paper

Read More »

162

click to vote

ALT
2007
Springer

134views Machine Learning» more ALT 2007»

Tuning Bandit Algorithms in Stochastic Environments

16 years 4 months ago

Download www.sztaki.hu

Algorithms based on upper-conﬁdence bounds for balancing exploration and exploitation are gaining popularity since they are easy to implement, eﬃcient and eﬀective. In this p...

Jean-Yves Audibert, Rémi Munos, Csaba Szepe...

claim paper

Read More »

220

click to vote

JMLR
2010

125views more JMLR 2010»

Regret Bounds for Gaussian Process Bandit Problems

15 years 2 months ago

Download jmlr.csail.mit.edu

Bandit algorithms are concerned with trading exploration with exploitation where a number of options are available but we can only learn their quality by experimenting with them. ...

Steffen Grünewälder, Jean-Yves Audibert,...

claim paper

Read More »

213

click to vote

ECCC
2010

80views more ECCC 2010»

Regret Minimization for Online Buffering Problems Using the Weighted Majority Algorithm

15 years 7 months ago

Download www.colt2010.org

Suppose a decision maker has to purchase a commodity over time with varying prices and demands. In particular, the price per unit might depend on the amount purchased and this pri...

Melanie Winkler, Berthold Vöcking, Sascha Geu...

claim paper

Read More »

« Prev « First page 2 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers