Search Sciweavers | Sciweavers

403 search results - page 29 / 81

» Sampling Bounds for Stochastic Optimization

169

click to vote

COLT
2008
Springer

140views Machine Learning» more COLT 2008»

Regret Bounds for Sleeping Experts and Bandits

15 years 9 months ago

Download colt2008.cs.helsinki.fi

We study on-line decision problems where the set of actions that are available to the decision algorithm vary over time. With a few notable exceptions, such problems remained larg...

Robert D. Kleinberg, Alexandru Niculescu-Mizil, Yo...

claim paper

Read More »

200

click to vote

IPCO
2010

125views Optimization» more IPCO 2010»

A Pumping Algorithm for Ergodic Stochastic Mean Payoff Games with Perfect Information

15 years 8 months ago

Download www.mpi-inf.mpg.de

Abstract. We consider two-person zero-sum stochastic mean payoff games with perfect information, or BWR-games, given by a digraph G = (V = VB VW VR, E), with local rewards r : E R...

Endre Boros, Khaled M. Elbassioni, Vladimir Gurvic...

claim paper

Read More »

241

click to vote

AAAI
2012

198views Intelligent Agents» more AAAI 2012»

A Search Algorithm for Latent Variable Models with Unbounded Domains

13 years 9 months ago

Download www.cs.ubc.ca

This paper concerns learning and prediction with probabilistic models where the domain sizes of latent variables have no a priori upper-bound. Current approaches represent prior d...

Michael Chiang, David Poole

claim paper

Read More »

181

Voted

AAAI
1998

100views Intelligent Agents» more AAAI 1998»

Branch and Bound Algorithm Selection by Performance Prediction

15 years 8 months ago

Download www.aaai.org

Wepropose a method called Selection by Performance Prediction (SPP) which allows one, when faced with a particular problem instance, to select a Branch and Boundalgorithm from amo...

Lionel Lobjois, Michel Lemaître

claim paper

Read More »

254

click to vote

ICML
2010
IEEE

204views Machine Learning» more ICML 2010»

Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design

15 years 8 months ago

Download www.its.caltech.edu

Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We formalize this task as a multiarmed bandit problem, where the payoff function is ...

Niranjan Srinivas, Andreas Krause, Sham Kakade, Ma...

claim paper

Read More »

« Prev « First page 29 / 81 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers