Sciweavers

176

Voted

ICML
2006
IEEE

108views Machine Learning» more ICML 2006»

Experience-efficient learning in associative bandit problems

16 years 7 months ago

We formalize the associative bandit problem framework introduced by Kaelbling as a learning-theory problem. The learning environment is modeled as a k-armed bandit where arm payof...

Alexander L. Strehl, Chris Mesterharm, Michael L. ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers