Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

182

IJCAI
2007

194views Artificial Intelligence» more IJCAI 2007»

Average-Reward Decentralized Markov Decision Processes

15 years 9 months ago

Average-Reward Decentralized Markov Decision Processes

Download anytime.cs.umass.edu

Formal analysis of decentralized decision making has become a thriving research area in recent years, producing a number of multi-agent extensions of Markov decision processes. While much of the work has focused on optimizing discounted cumulative reward, optimizing average reward is sometimes a more suitable criterion. We formalize a class of such problems and analyze its characteristics, showing that it is NP complete and that optimal policies are deterministic. Our analysis lays the foundation for designing two optimal algorithms. Experimental results with a standard problem from the literature illustrate the applicability of these solution techniques.

Marek Petrik, Shlomo Zilberstein

Real-time Traffic

Artificial Intelligence | Decentralized Decision Making | IJCAI 2007 | Markov Decision Processes | Thriving Research Area |

claim paper

Related Content

» Bounded Parameter Markov Decision Processes with Average Reward Criterion

» Pseudometrics for State Aggregation in Average Reward Markov Decision Processes

» Complexity of Probabilistic Planning under Average Rewards

» Adaptive Stepsize Policy Gradients with Average Reward Metric

» ContinuousTime Hierarchical Reinforcement Learning

» On step sizes stochastic shortest paths and survival probabilities in Reinforcement Learni...

» Transitionindependent decentralized markov decision processes

» The Complexity of Decentralized Control of Markov Decision Processes

» A dynamic programming algorithm for decentralized Markov decision processes with a broadca...

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	IJCAI
Authors	Marek Petrik, Shlomo Zilberstein

Comments (0)