Search Sciweavers | Sciweavers

337 search results - page 7 / 68

» Mean-Variance Optimization in Markov Decision Processes

164

click to vote

CDC
2010
IEEE

141views Control Systems» more CDC 2010»

A dynamic programming algorithm for decentralized Markov decision processes with a broadcast structure

15 years 6 hour ago

Download junction.stanford.edu

We give an optimal dynamic programming algorithm to solve a class of finite-horizon decentralized Markov decision processes (MDPs). We consider problems with a broadcast informati...

Jeff Wu, Sanjay Lall

claim paper

Read More »

189

click to vote

AIPS
2011

233views Artificial Intelligence» more AIPS 2011»

Sample-Based Planning for Continuous Action Markov Decision Processes

14 years 8 months ago

Download www.chrismansley.com

In this paper, we present a new algorithm that integrates recent advances in solving continuous bandit problems with sample-based rollout methods for planning in Markov Decision P...

Christopher R. Mansley, Ari Weinstein, Michael L. ...

claim paper

Read More »

146

click to vote

AAAI
1998

129views Intelligent Agents» more AAAI 1998»

Solving Very Large Weakly Coupled Markov Decision Processes

15 years 6 months ago

Download www.cs.toronto.edu

We present a technique for computing approximately optimal solutions to stochastic resource allocation problems modeled as Markov decision processes (MDPs). We exploit two key pro...

Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, L...

claim paper

Read More »

128

click to vote

ALT
2008
Springer

141views Machine Learning» more ALT 2008»

Online Regret Bounds for Markov Decision Processes with Deterministic Transitions

16 years 1 months ago

Download personal.unileoben.ac.at

Abstract. We consider an upper conﬁdence bound algorithm for Markov decision processes (MDPs) with deterministic transitions. For this algorithm we derive upper bounds on the onl...

Ronald Ortner

claim paper

Read More »

140

click to vote

LICS
2007
IEEE

121views Automated Reasoning» more LICS 2007»

Limits of Multi-Discounted Markov Decision Processes

15 years 11 months ago

Download www.labri.fr

Markov decision processes (MDPs) are controllable discrete event systems with stochastic transitions. The payoff received by the controller can be evaluated in different ways, dep...

Hugo Gimbert, Wieslaw Zielonka

claim paper

Read More »

« Prev « First page 7 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers