Search Sciweavers | Sciweavers

2 search results - page 1 / 1

» Piecewise-stationary bandit problems with side observations

197

click to vote

ICML
2009
IEEE

109views Machine Learning» more ICML 2009»

Piecewise-stationary bandit problems with side observations

16 years 8 months ago

Download www.cim.mcgill.ca

We consider a sequential decision problem where the rewards are generated by a piecewise-stationary distribution. However, the different reward distributions are unknown and may c...

Jia Yuan Yu, Shie Mannor

claim paper

Read More »

175

click to vote

ICML
2008
IEEE

120views Machine Learning» more ICML 2008»

Exploration scavenging

16 years 8 months ago

Download hunch.net

We examine the problem of evaluating a policy in the contextual bandit setting using only observations collected during the execution of another policy. We show that policy evalua...

John Langford, Alexander L. Strehl, Jennifer Wortm...

claim paper

Read More »

« Prev « First page 1 / 1 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers