Search Sciweavers | Sciweavers

201 search results - page 37 / 41

» Solving Concurrent Markov Decision Processes

177

click to vote

NIPS
2008

109views Information Technology» more NIPS 2008»

Biasing Approximate Dynamic Programming with a Lower Discount Factor

15 years 8 months ago

Download hal.inria.fr

Most algorithms for solving Markov decision processes rely on a discount factor, which ensures their convergence. It is generally assumed that using an artificially low discount f...

Marek Petrik, Bruno Scherrer

claim paper

Read More »

164

click to vote

NIPS
2007

146views Information Technology» more NIPS 2007»

Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs

15 years 8 months ago

Download books.nips.cc

We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

175

click to vote

NIPS
2001

158views Information Technology» more NIPS 2001»

Multiagent Planning with Factored MDPs

15 years 8 months ago

Download books.nips.cc

We present a principled and efficient planning algorithm for cooperative multiagent dynamic systems. A striking feature of our method is that the coordination and communication be...

Carlos Guestrin, Daphne Koller, Ronald Parr

claim paper

Read More »

168

click to vote

IJCAI
2003

137views Artificial Intelligence» more IJCAI 2003»

Approximating Optimal Policies for Agents with Limited Execution Resources

15 years 8 months ago

Download ai.stanford.edu

An agent with limited consumable execution resources needs policies that attempt to achieve good performance while respecting these limitations. Otherwise, an agent (such as a pla...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

182

click to vote

JMLR
2006

190views more JMLR 2006»

Causal Graph Based Decomposition of Factored MDPs

15 years 6 months ago

Download www-anw.cs.umass.edu

We present Variable Influence Structure Analysis, or VISA, an algorithm that performs hierarchical decomposition of factored Markov decision processes. VISA uses a dynamic Bayesia...

Anders Jonsson, Andrew G. Barto

claim paper

Read More »

« Prev « First page 37 / 41 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers