Sciweavers

190

CORR
2010
Springer

152views Education» more CORR 2010»

Combinatorial Network Optimization with Unknown Variables: Multi-Armed Bandits with Linear Rewards

15 years 1 months ago

In the classic multi-armed bandits problem, the goal is to have a policy for dynamically operating arms that each yield stochastic rewards with unknown means. The key metric of int...

Yi Gai, Bhaskar Krishnamachari, Rahul Jain

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers