Search Sciweavers | Sciweavers

125 search results - page 20 / 25

» The Stochastic Machine Replenishment Problem

191

click to vote

ICML
1994
IEEE

151views Machine Learning» more ICML 1994»

Learning Without State-Estimation in Partially Observable Markovian Decision Processes

15 years 10 months ago

Download www.eecs.umich.edu

Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

195

Voted

ICML
2010
IEEE

227views Machine Learning» more ICML 2010»

Learning Efficiently with Approximate Inference via Dual Losses

15 years 7 months ago

Download www.cs.huji.ac.il

Many structured prediction tasks involve complex models where inference is computationally intractable, but where it can be well approximated using a linear programming relaxation...

Ofer Meshi, David Sontag, Tommi Jaakkola, Amir Glo...

claim paper

Read More »

184

click to vote

ML
2002
ACM

143views Machine Learning» more ML 2002»

A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes

15 years 6 months ago

Download www.cis.upenn.edu

An issue that is critical for the application of Markov decision processes MDPs to realistic problems is how the complexity of planning scales with the size of the MDP. In stochas...

Michael J. Kearns, Yishay Mansour, Andrew Y. Ng

claim paper

Read More »

194

click to vote

COLT
2010
Springer

191views Machine Learning» more COLT 2010»

Best Arm Identification in Multi-Armed Bandits

15 years 4 months ago

Download www.di.ens.fr

We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...

Jean-Yves Audibert, Sébastien Bubeck, R&eac...

claim paper

Read More »

166

click to vote

SC
1995
ACM

102views Applied Computing» more SC 1995»

Distributing a Chemical Process Optimization Application Over a Gigabit Network

15 years 10 months ago

Download www.chg.ru

We evaluate the impact of a gigabit network on the implementation of a distributed chemical process optimization application. The optimization problem is formulated as a stochasti...

Robert L. Clay, Peter Steenkiste

claim paper

Read More »

« Prev « First page 20 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers