Search Sciweavers | Sciweavers

46 search results - page 5 / 10

» Delayed Nondeterminism in Continuous-Time Markov Decision Pr...

152

click to vote

AIPS
2008

111views Artificial Intelligence» more AIPS 2008»

Multiagent Planning Under Uncertainty with Stochastic Communication Delays

15 years 8 months ago

Download www.aaai.org

We consider the problem of cooperative multiagent planning under uncertainty, formalized as a decentralized partially observable Markov decision process (Dec-POMDP). Unfortunately...

Matthijs T. J. Spaan, Frans A. Oliehoek, Nikos A. ...

claim paper

Read More »

148

Voted

GLOBECOM
2006
IEEE

99views Communications» more GLOBECOM 2006»

Optimal Routing Between Alternate Paths With Different Network Transit Delays

16 years 1 days ago

Download www.cs.ucr.edu

— We consider the path-determination problem in Internet core routers that distribute ﬂows across alternate paths leading to the same destination. We assume that the remainder ...

Essia Hamouda Elhafsi, Mart Molle

claim paper

Read More »

211

click to vote

QEST
2010
IEEE

139views Modeling and Simulation» more QEST 2010»

Reasoning about MDPs as Transformers of Probability Distributions

15 years 3 months ago

Download osl.cs.uiuc.edu

We consider Markov Decision Processes (MDPs) as transformers on probability distributions, where with respect to a scheduler that resolves nondeterminism, the MDP can be seen as ex...

Vijay Anand Korthikanti, Mahesh Viswanathan, Gul A...

claim paper

Read More »

154

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 6 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

190

click to vote

AIPS
2006

211views Artificial Intelligence» more AIPS 2006»

Solving Factored MDPs with Exponential-Family Transition Models

15 years 7 months ago

Download www.cs.pitt.edu

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

« Prev « First page 5 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers