Search Sciweavers | Sciweavers

81 search results - page 6 / 17

» The Sample Average Approximation Method for Stochastic Discr...

173

Voted

ECAI
2010
Springer

232views Artificial Intelligence» more ECAI 2010»

EP for Efficient Stochastic Control with Obstacles

15 years 7 months ago

Download www.snn.ru.nl

Abstract. We address the problem of continuous stochastic optimal control in the presence of hard obstacles. Due to the non-smooth character of the obstacles, the traditional appro...

Thomas Mensink, Jakob J. Verbeek, Bert Kappen

claim paper

Read More »

172

click to vote

AAAI
2012

202views Intelligent Agents» more AAAI 2012»

Lagrangian Relaxation Techniques for Scalable Spatial Conservation Planning

13 years 9 months ago

Download rbr.cs.umass.edu

We address the problem of spatial conservation planning in which the goal is to maximize the expected spread of cascades of an endangered species by strategically purchasing land ...

Akshat Kumar, XiaoJian Wu, Shlomo Zilberstein

claim paper

Read More »

212

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Aggregation-based model reduction of a Hidden Markov Model

15 years 1 months ago

Download mechse.illinois.edu

This paper is concerned with developing an information-theoretic framework to aggregate the state space of a Hidden Markov Model (HMM) on discrete state and observation spaces. The...

Kun Deng, Prashant G. Mehta, Sean P. Meyn

claim paper

Read More »

136

click to vote

ICIP
2005
IEEE

151views Image Processing» more ICIP 2005»

Beyond interpolation: optimal reconstruction by quasi-interpolation

16 years 8 months ago

Download www.ee.cuhk.edu.hk

We investigate the use of quasi-interpolating approximation schemes, to construct an estimate of an unknown function from its given discrete samples. We show theoretically and wit...

Laurent Condat, Thierry Blu, Michael Unser

claim paper

Read More »

156

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 6 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers