Bellman equation | Sciweavers

71

AIPS
2011

216views Artificial Intelligence» more AIPS 2011»

Heuristic Search for Generalized Stochastic Shortest Path MDPs

13 years 6 months ago

Research in efﬁcient methods for solving inﬁnite-horizon MDPs has so far concentrated primarily on discounted MDPs and the more general stochastic shortest path problems (SSPs...

Andrey Kolobov, Mausam, Daniel S. Weld, Hector Gef...

claim paper

Read More »

44

click to vote

CORR
2010
Springer

127views Education» more CORR 2010»

Mean field for Markov Decision Processes: from Discrete to Continuous Optimization

14 years 3 months ago

Download infoscience.epfl.ch

We study the convergence of Markov Decision Processes made of a large number of objects to optimization problems on ordinary differential equations (ODE). We show that the optimal...

Nicolas Gast, Bruno Gaujal, Jean-Yves Le Boudec

claim paper

Read More »

42

click to vote

ICML
2010
IEEE

189views Machine Learning» more ICML 2010»

Nonparametric Return Distribution Approximation for Reinforcement Learning

14 years 4 months ago

Download www.icml2010.org

Standard Reinforcement Learning (RL) aims to optimize decision-making rules in terms of the expected return. However, especially for risk-management purposes, other criteria such ...

Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashim...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers