Sciweavers

188

CORR
2012
Springer

210views Education» more CORR 2012»

Towards minimax policies for online linear optimization with bandit feedback

14 years 2 months ago

We address the online linear optimization problem with bandit feedback. Our contribution is twofold. First, we provide an algorithm (based on exponential weights) with a regret of...

Sébastien Bubeck, Nicolò Cesa-Bianch...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers