Search Sciweavers | Sciweavers

125 search results - page 6 / 25

» The Stochastic Machine Replenishment Problem

155

click to vote

ALT
2011
Springer

259views Machine Learning» more ALT 2011»

Deviations of Stochastic Bandit Regret

14 years 2 months ago

Download certis.enpc.fr

This paper studies the deviations of the regret in a stochastic multi-armed bandit problem. When the total number of plays n is known beforehand by the agent, Audibert et al. (2009...

Antoine Salomon, Jean-Yves Audibert

claim paper

Read More »

140

click to vote

ML
2000
ACM

103views Machine Learning» more ML 2000»

Nonparametric Time Series Prediction Through Adaptive Model Selection

15 years 2 months ago

Download webee.technion.ac.il

We consider the problem of one-step ahead prediction for time series generated by an underlying stationary stochastic process obeying the condition of absolute regularity, describi...

Ron Meir

claim paper

Read More »

106

click to vote

ALT
2009
Springer

128views Machine Learning» more ALT 2009»

Pure Exploration in Multi-armed Bandits Problems

15 years 11 months ago

Download sequel.futurs.inria.fr

Abstract. We consider the framework of stochastic multi-armed bandit problems and study the possibilities and limitations of strategies that explore sequentially the arms. The stra...

Sébastien Bubeck, Rémi Munos, Gilles...

claim paper

Read More »

103

click to vote

SDM
2010
SIAM

151views Data Mining» more SDM 2010»

Fast Stochastic Frank-Wolfe Algorithms for Nonlinear SVMs

15 years 4 months ago

Download www.cc.gatech.edu

The high computational cost of nonlinear support vector machines has limited their usability for large-scale problems. We propose two novel stochastic algorithms to tackle this pr...

Hua Ouyang, Alexander Gray

claim paper

Read More »

116

click to vote

ICML
2009
IEEE

172views Machine Learning» more ICML 2009»

Model-free reinforcement learning as mixture learning

16 years 3 months ago

Download user.cs.tu-berlin.de

We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both the infinite and finite horizo...

Nikos Vlassis, Marc Toussaint

claim paper

Read More »

« Prev « First page 6 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers