Search Sciweavers | Sciweavers

2031 search results - page 94 / 407

» Approximation Algorithms for 2-Stage Stochastic Optimization...

132

Voted

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 16 days ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

click to vote

CDC
2008
IEEE

97views Control Systems» more CDC 2008»

A monotonic algorithm for the optimal control of the Fokker-Planck equation

15 years 9 months ago

Download hal.archives-ouvertes.fr

— Motivated by some crowd motion models in the presence of noise, we consider an optimal control problem governed by the Fokker-Planck equation. We sketch optimality conditions b...

Guillaume Carlier, Julien Salomon

claim paper

Read More »

133

Voted

CORR
2011
Springer

204views Education» more CORR 2011»

Accelerated Dual Descent for Network Optimization

14 years 9 months ago

Download web.mit.edu

—Dual descent methods are commonly used to solve network optimization problems because their implementation can be distributed through the network. However, their convergence rat...

Michael Zargham, A. Ribeiro, Ali Jadbabaie, Asuman...

claim paper

Read More »

121

Voted

AAAI
2000

139views Intelligent Agents» more AAAI 2000»

Localizing Search in Reinforcement Learning

15 years 3 months ago

Download www.cs.colorado.edu

Reinforcement learning (RL) can be impractical for many high dimensional problems because of the computational cost of doing stochastic search in large state spaces. We propose a ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

237

click to vote

NLP
2000

174views Natural Language Processing» more NLP 2000»

Monte-Carlo Sampling for NP-Hard Maximization Problems in the Framework of Weighted Parsing

15 years 6 months ago

Download liawww.epfl.ch

Abstract. The purpose of this paper is (1) to provide a theoretical justification for the use of Monte-Carlo sampling for approximate resolution of NP-hard maximization problems in...

Jean-Cédric Chappelier, Martin Rajman

claim paper

Read More »

« Prev « First page 94 / 407 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers