Search Sciweavers | Sciweavers

263 search results - page 17 / 53

» Regret Bounds for Prediction Problems

click to vote

CORR
2007
Springer

106views Education» more CORR 2007»

Bandit Algorithms for Tree Search

13 years 9 months ago

Download hal.inria.fr

Bandit based methods for tree search have recently gained popularity when applied to huge trees, e.g. in the game of go [6]. Their eﬃcient exploration of the tree enables to ret...

Pierre-Arnaud Coquelin, Rémi Munos

claim paper

Read More »

click to vote

JMLR
2008

137views more JMLR 2008»

Online Learning of Complex Prediction Problems Using Simultaneous Projections

13 years 9 months ago

Download jmlr.csail.mit.edu

We describe and analyze an algorithmic framework for online classification where each online trial consists of multiple prediction tasks that are tied together. We tackle the prob...

Yonatan Amit, Shai Shalev-Shwartz, Yoram Singer

claim paper

Read More »

click to vote

JMLR
2010

101views more JMLR 2010»

Efficient Reductions for Imitation Learning

13 years 4 months ago

Download www.cs.cmu.edu

Imitation Learning, while applied successfully on many large real-world problems, is typically addressed as a standard supervised learning problem, where it is assumed the trainin...

Stéphane Ross, Drew Bagnell

claim paper

Read More »

click to vote

TSP
2010

170views Artificial Intelligence» more TSP 2010»

Distributed learning in multi-armed bandit with multiple players

13 years 4 months ago

Download www.ece.ucdavis.edu

We formulate and study a decentralized multi-armed bandit (MAB) problem. There are distributed players competing for independent arms. Each arm, when played, offers i.i.d. reward a...

Keqin Liu, Qing Zhao

claim paper

Read More »

click to vote

JMLR
2010

161views more JMLR 2010»

Dual Averaging Methods for Regularized Stochastic Learning and Online Optimization

13 years 4 months ago

Download jmlr.csail.mit.edu

We consider regularized stochastic learning and online optimization problems, where the objective function is the sum of two convex terms: one is the loss function of the learning...

Lin Xiao

claim paper

Read More »

« Prev « First page 17 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers