Solving Very Large Weakly Coupled Markov Decision Processes

15 years 3 months ago

Download www.cs.toronto.edu

We present a technique for computing approximately optimal solutions to stochastic resource allocation problems modeled as Markov decision processes (MDPs). We exploit two key properties to avoid explicitly enumerating the very large state and action spaces associated with these problems. First, the problems are composed of multiple tasks whose utilities are independent. Second, the actions taken with respect to (or resources allocated to) a task do not influence the status of any other task. We can therefore view each task as an MDP. However, these MDPs are weakly coupled by resource constraints: actions selected for one MDP restrict the actions available to others. We describe heuristic techniques for dealing with several classes of constraints that use the solutions for individual MDPsto construct an approximate global solution. We demonstrate this techniqueon problems involving thousandsof tasks, approximating the solution to problems that are far beyondthe reach of standard metho...

Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, L

Real-time Traffic

AAAI 1998 | Intelligent Agents | Markov Decision | Optimal Solutions | Resource Allocation Problems |

claim paper

» Model Minimization in Markov Decision Processes

» Building efficient partial plans using Markov decision processes

» VeriML typed computation of logical terms inside a language with effects

» Symmetric approximate linear programming for factored MDPs with application to constrained...

» Dynamic Programming for Partially Observable Stochastic Games

» Decision Making in Uncertain RealWorld Domains Using DTGolog

» Planning in Factored Action Spaces with Symbolic Dynamic Programming

» On step sizes stochastic shortest paths and survival probabilities in Reinforcement Learni...

Post Info
More Details (n/a)

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	1998
Where	AAAI
Authors	Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, Leonid Peshkin, Leslie Pack Kaelbling, Thomas Dean, Craig Boutilier

Comments (0)

Sciweavers

Solving Very Large Weakly Coupled Markov Decision Processes

AAAI 1998 | Intelligent Agents | Markov Decision | Optimal Solutions | Resource Allocation Problems |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers