Search Sciweavers | Sciweavers

5214 search results - page 120 / 1043

» The Online Specialization Problem

click to vote

AAAI
1998

181views Intelligent Agents» more AAAI 1998»

Applying Online Search Techniques to Continuous-State Reinforcement Learning

13 years 10 months ago

Download www.autonlab.org

In this paper, we describe methods for e ciently computing better solutions to control problems in continuous state spaces. We provide algorithms that exploit online search to boo...

Scott Davies, Andrew Y. Ng, Andrew W. Moore

claim paper

Read More »

click to vote

SIAMSC
2010

132views more SIAMSC 2010»

New Algorithms for Optimal Online Checkpointing

13 years 7 months ago

Download tu-dresden.de

Frequently, the computation of derivatives for optimizing time-dependent problems is based on the integration of the adjoint diﬀerential equation. For this purpose, the knowledge...

Philipp Stumm, Andrea Walther

claim paper

Read More »

click to vote

ANOR
2010

112views more ANOR 2010»

Online stochastic optimization under time constraints

13 years 7 months ago

Download public.lanl.gov

This paper considers online stochastic optimization problems where uncertainties are characterized by a distribution that can be sampled and where time constraints severely limit t...

Pascal Van Hentenryck, Russell Bent, Eli Upfal

claim paper

Read More »

click to vote

JMLR
2010

161views more JMLR 2010»

Dual Averaging Methods for Regularized Stochastic Learning and Online Optimization

13 years 3 months ago

Download jmlr.csail.mit.edu

We consider regularized stochastic learning and online optimization problems, where the objective function is the sum of two convex terms: one is the loss function of the learning...

Lin Xiao

claim paper

Read More »

click to vote

KDD
2012
ACM

292views Data Mining» more KDD 2012»

Online allocation of display ads with smooth delivery

11 years 11 months ago

Download www.seas.upenn.edu

Display ads on the Internet are often sold in bundles of thousands or millions of impressions over a particular time period, typically weeks or months. Ad serving systems that ass...

Anand Bhalgat, Jon Feldman, Vahab S. Mirrokni

claim paper

Read More »

« Prev « First page 120 / 1043 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers