Resource Allocation Among Agents with MDP-Induced Preferences

13 years 11 months ago

Download www.jair.org

Allocating scarce resources among agents to maximize global utility is, in general, computationally challenging. We focus on problems where resources enable agents to execute actions in stochastic environments, modeled as Markov decision processes (MDPs), such that the value of a resource bundle is defined as the expected value of the optimal MDP policy realizable given these resources. We present an algorithm that simultaneously solves the resource-allocation and the policy-optimization problems. This allows us to avoid explicitly representing utilities over exponentially many resource bundles, leading to drastic (often exponential) reductions in computational complexity. We then use this algorithm in the context of self-interested agents to design a combinatorial auction for allocating resources. We empirically demonstrate the effectiveness of our approach by showing that it can, in minutes, optimally solve problems for which a straightforward combinatorial resource-allocation techn...

Dmitri A. Dolgov, Edmund H. Durfee

Real-time Traffic

JAIR 2006 | Optimal Mdp Policy | Resource | Resource Bundles |

claim paper

Post Info
More Details (n/a)

Added	13 Dec 2010
Updated	13 Dec 2010
Type	Journal
Year	2006
Where	JAIR
Authors	Dmitri A. Dolgov, Edmund H. Durfee

Comments (0)

Sciweavers

Resource Allocation Among Agents with MDP-Induced Preferences

JAIR 2006 | Optimal Mdp Policy | Resource | Resource Bundles |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers