Search Sciweavers | Sciweavers

185 search results - page 3 / 37

» Simulation-Based Optimization Algorithms for Finite-Horizon ...

146

Voted

COLT
2007
Springer

143views Machine Learning» more COLT 2007»

Bounded Parameter Markov Decision Processes with Average Reward Criterion

15 years 9 months ago

Download ttic.uchicago.edu

Bounded parameter Markov Decision Processes (BMDPs) address the issue of dealing with uncertainty in the parameters of a Markov Decision Process (MDP). Unlike the case of an MDP, t...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

124

Voted

MOR
2007

109views more MOR 2007»

Solution and Forecast Horizons for Infinite-Horizon Nonhomogeneous Markov Decision Processes

15 years 2 months ago

Download www-personal.umich.edu

We consider the problem of solving a nonhomogeneous inﬁnite horizon Markov Decision Process (MDP) problem in the general case of potentially multiple optimal ﬁrst period polic...

Torpong Cheevaprawatdomrong, Irwin E. Schochetman,...

claim paper

Read More »

122

Voted

IJCAI
2007

194views Artificial Intelligence» more IJCAI 2007»

Average-Reward Decentralized Markov Decision Processes

15 years 4 months ago

Download anytime.cs.umass.edu

Formal analysis of decentralized decision making has become a thriving research area in recent years, producing a number of multi-agent extensions of Markov decision processes. Wh...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

123

Voted

ICML
2001
IEEE

145views Machine Learning» more ICML 2001»

Symmetry in Markov Decision Processes and its Implications for Single Agent and Multiagent Learning

16 years 4 months ago

Download www-2.cs.cmu.edu

This paper examines the notion of symmetry in Markov decision processes (MDPs). We define symmetry for an MDP and show how it can be exploited for more effective learning in singl...

Martin Zinkevich, Tucker R. Balch

claim paper

Read More »

111

Voted

AAAI
2004

117views Intelligent Agents» more AAAI 2004»

Solving Concurrent Markov Decision Processes

15 years 4 months ago

Download www.cs.washington.edu

Typically, Markov decision problems (MDPs) assume a single action is executed per decision epoch, but in the real world one may frequently execute certain actions in parallel. Thi...

Mausam, Daniel S. Weld

claim paper

Read More »

« Prev « First page 3 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers