Search Sciweavers | Sciweavers

271 search results - page 55 / 55

» Identifying Optimal Sequential Decisions

280

click to vote

AMAI
2011
Springer

273views Artificial Intelligence» more AMAI 2011»

Multi-armed bandits with episode context

14 years 7 months ago

Download gauss.ececs.uc.edu

A multi-armed bandit episode consists of n trials, each allowing selection of one of K arms, resulting in payoff from a distribution over [0, 1] associated with that arm. We assum...

Christopher D. Rosin

claim paper

Read More »

« Prev « First page 55 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers