Sciweavers

144

Voted

CORR
2008
Springer

64views Education» more CORR 2008»

15 years 6 months ago

We consider bandit problems involving a large (possibly infinite) collection of arms, in which the expected reward of each arm is a linear function of an r-dimensional random vect...

Paat Rusmevichientong, John N. Tsitsiklis

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers