Search Sciweavers | Sciweavers

424 search results - page 24 / 85

» Boosted sampling: approximation algorithms for stochastic op...

117

click to vote

ICML
2009
IEEE

172views Machine Learning» more ICML 2009»

Model-free reinforcement learning as mixture learning

16 years 3 months ago

Download user.cs.tu-berlin.de

We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both the infinite and finite horizo...

Nikos Vlassis, Marc Toussaint

claim paper

Read More »

119

Voted

IOR
2010

71views more IOR 2010»

Stochastic Root Finding and Efficient Estimation of Convex Risk Measures

15 years 6 days ago

Download legacy.orie.cornell.edu

Reliable risk measurement is a key problem for financial institutions and regulatory authorities. The current industry standard Value-at-Risk has several deficiencies. Improved ri...

Jörn Dunkel, Stefan Weber

claim paper

Read More »

106

click to vote

AUSAI
2004
Springer

211views Artificial Intelligence» more AUSAI 2004»

An ACO Algorithm for the Most Probable Explanation Problem

15 years 8 months ago

Download www.kddresearch.org

We describe an Ant Colony Optimization (ACO) algorithm, ANT-MPE, for the most probable explanation problem in Bayesian network inference. After tuning its parameters settings, we c...

Haipeng Guo, Prashanth R. Boddhireddy, William H. ...

claim paper

Read More »

131

Voted

ENVSOFT
2007

258views more ENVSOFT 2007»

Optimal groundwater monitoring design using an ant colony optimization paradigm

15 years 2 months ago

Download www.eng.fsu.edu

Groundwater long-term monitoring (LTM) is required to assess the performance of groundwater remediation and human being health risk at post-closure sites where groundwater contami...

Yuanhai Li, Amy B. Chan Hilton

claim paper

Read More »

125

click to vote

CDC
2010
IEEE

167views Control Systems» more CDC 2010»

Numerical methods for the optimization of nonlinear stochastic delay systems, and an application to internet regulation

14 years 10 months ago

Download www.dam.brown.edu

The Markov chain approximation method is an effective and widely used approach for computing optimal values and controls for stochastic systems. It was extended to nonlinear (and p...

Harold J. Kushner

claim paper

Read More »

« Prev « First page 24 / 85 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers