Sciweavers

164

ECML
2005
Springer

105views Machine Learning» more ECML 2005»

Multi-armed Bandit Algorithms and Empirical Evaluation

16 years 4 days ago

The multi-armed bandit problem for a gambler is to decide which arm of a K-slot machine to pull to maximize his total reward in a series of trials. Many real-world learning and opt...

Joannès Vermorel, Mehryar Mohri

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers