Search Sciweavers | Sciweavers

22 search results - page 3 / 5

» Solving Factored MDPs with Exponential-Family Transition Mod...

172

Voted

NIPS
2007

146views Information Technology» more NIPS 2007»

Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs

15 years 8 months ago

Download books.nips.cc

We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

186

click to vote

IAT
2009
IEEE

139views Intelligent Agents» more IAT 2009»

Offline Planning for Communication by Exploiting Structured Interactions in Decentralized MDPs

15 years 11 months ago

Download mas.cs.umass.edu

Variants of the decentralized MDP model focus on problems exhibiting some special structure that makes them easier to solve in practice. Our work is concerned with two main issues...

Hala Mostafa, Victor R. Lesser

claim paper

Read More »

186

click to vote

AI
2000
Springer

154views Artificial Intelligence» more AI 2000»

Stochastic dynamic programming with factored representations

15 years 6 months ago

Download www.cs.tufts.edu

Markov decisionprocesses(MDPs) haveproven to be popular models for decision-theoretic planning, but standard dynamic programming algorithms for solving MDPs rely on explicit, stat...

Craig Boutilier, Richard Dearden, Moisés Go...

claim paper

Read More »

188

click to vote

AAAI
1997

139views Intelligent Agents» more AAAI 1997»

Model Minimization in Markov Decision Processes

15 years 8 months ago

Download www.cs.brown.edu

Many stochastic planning problems can be represented using Markov Decision Processes (MDPs). A difficulty with using these MDP representations is that the common algorithms for so...

Thomas Dean, Robert Givan

claim paper

Read More »

219

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

15 years 9 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

« Prev « First page 3 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers