Markov Decision | Sciweavers

55

CDC
2010
IEEE

141views Control Systems» more CDC 2010»

A dynamic programming algorithm for decentralized Markov decision processes with a broadcast structure

13 years 10 months ago

We give an optimal dynamic programming algorithm to solve a class of finite-horizon decentralized Markov decision processes (MDPs). We consider problems with a broadcast informati...

Jeff Wu, Sanjay Lall

claim paper

Read More »

83

click to vote

QEST
2010
IEEE

139views Modeling and Simulation» more QEST 2010»

Reasoning about MDPs as Transformers of Probability Distributions

14 years 1 months ago

Download osl.cs.uiuc.edu

We consider Markov Decision Processes (MDPs) as transformers on probability distributions, where with respect to a scheduler that resolves nondeterminism, the MDP can be seen as ex...

Vijay Anand Korthikanti, Mahesh Viswanathan, Gul A...

claim paper

Read More »

37

click to vote

FMSD
2010

77views more FMSD 2010»

A game-based abstraction-refinement framework for Markov decision processes

14 years 1 months ago

Download www.prismmodelchecker.org

ASED ABSTRACTION-REFINEMENT FRAMEWORK FOR MARKOV DECISION PROCESSES Mark Kattenbelt Marta Kwiatkowska Gethin Norman David Parker CL-RR-08-06 Oxford University Computing Laborator...

Mark Kattenbelt, Marta Z. Kwiatkowska, Gethin Norm...

claim paper

Read More »

50

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

14 years 1 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

53

click to vote

JAIR
2002

120views more JAIR 2002»

Learning Geometrically-Constrained Hidden Markov Models for Robot Navigation: Bridging the Topological-Geometrical Gap

14 years 2 months ago

Download www.jair.org

Hidden Markov models hmms and partially observable Markov decision processes pomdps provide useful tools for modeling dynamical systems. They are particularly useful for represent...

Hagit Shatkay, Leslie Pack Kaelbling

claim paper

Read More »

58

click to vote

ENTCS
2006

134views more ENTCS 2006»

Partial Order Reduction for Probabilistic Branching Time

14 years 3 months ago

Download www.win.tue.nl

In the past, partial order reduction has been used successfully to combat the state explosion problem in the context of model checking for non-probabilistic systems. For both line...

Christel Baier, Pedro R. D'Argenio, Marcus Grö...

claim paper

Read More »

49

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

14 years 3 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

55

click to vote

CORR
2008
Springer

154views Education» more CORR 2008»

A Counterexample Guided Abstraction-Refinement Framework for Markov Decision Processes

14 years 3 months ago

Download tocl.acm.org

rexample Guided Abstraction-Refinement Framework for Markov Decision Processes ROHIT CHADHA and MAHESH VISWANATHAN Dept. of Computer Science, University of Illinois at Urbana-Champ...

Rohit Chadha, Mahesh Viswanathan

claim paper

Read More »

40

click to vote

CORR
2008
Springer

91views Education» more CORR 2008»

Significant Diagnostic Counterexamples in Probabilistic Model Checking

14 years 3 months ago

Download www.cs.ru.nl

Abstract. This paper presents a novel technique for counterexample generation in probabilistic model checking of Markov chains and Markov Decision Processes. (Finite) paths in coun...

Miguel E. Andrés, Pedro R. D'Argenio, Peter...

claim paper

Read More »

34

click to vote

CORR
2010
Springer

106views Education» more CORR 2010»

MDPs with Unawareness

14 years 3 months ago

Download www.cs.cornell.edu

Markov decision processes (MDPs) are widely used for modeling decision-making problems in robotics, automated control, and economics. Traditional MDPs assume that the decision mak...

Joseph Y. Halpern, Nan Rong, Ashutosh Saxena

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers