Sciweavers

157

CORR
2006
Springer

83views Education» more CORR 2006»

How to Beat the Adaptive Multi-Armed Bandit

15 years 6 months ago

The multi-armed bandit is a concise model for the problem of iterated decision-making under uncertainty. In each round, a gambler must pull one of K arms of a slot machine, withou...

Varsha Dani, Thomas P. Hayes

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers