Search Sciweavers | Sciweavers

66 search results - page 5 / 14

» The Nonstochastic Multiarmed Bandit Problem

163

click to vote

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

15 years 6 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

187

click to vote

COLT
2004
Springer

147views Machine Learning» more COLT 2004»

The Budgeted Multi-armed Bandit Problem

15 years 10 months ago

Download omadani.net

straction of the following scenarios: choosing from among a set of alternative treatments after a fixed number of clinical trials, determining the best parameter settings for a pro...

Omid Madani, Daniel J. Lizotte, Russell Greiner

claim paper

Read More »

232

click to vote

ALT
2011
Springer

259views Machine Learning» more ALT 2011»

Deviations of Stochastic Bandit Regret

14 years 7 months ago

Download certis.enpc.fr

This paper studies the deviations of the regret in a stochastic multi-armed bandit problem. When the total number of plays n is known beforehand by the agent, Audibert et al. (2009...

Antoine Salomon, Jean-Yves Audibert

claim paper

Read More »

205

click to vote

CORR
2011
Springer

202views Education» more CORR 2011»

Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems

15 years 2 months ago

Download www.ualberta.ca

The analysis of online least squares estimation is at the heart of many stochastic sequential decision-making problems. We employ tools from the self-normalized processes to provi...

Yasin Abbasi-Yadkori, Dávid Pál, Csa...

claim paper

Read More »

151

click to vote

FSTTCS
2010
Springer

98views Software Engineering» more FSTTCS 2010»

Playing in stochastic environment: from multi-armed bandits to two-player games

15 years 5 months ago

Download drops.dagstuhl.de

Given a zero-sum infinite game we examine the question if players have optimal memoryless deterministic strategies. It turns out that under some general conditions the problem for...

Wieslaw Zielonka

claim paper

Read More »

« Prev « First page 5 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers