Search Sciweavers | Sciweavers

32 search results - page 5 / 7

» Optimal and Approximate Q-value Functions for Decentralized ...

click to vote

IAT
2005
IEEE

132views Intelligent Agents» more IAT 2005»

Decomposing Large-Scale POMDP Via Belief State Analysis

14 years 1 months ago

Download www.comp.hkbu.edu.hk

Partially observable Markov decision process (POMDP) is commonly used to model a stochastic environment with unobservable states for supporting optimal decision making. Computing ...

Xin Li, William K. Cheung, Jiming Liu

claim paper

Read More »

click to vote

HICSS
2003
IEEE

207views Biometrics» more HICSS 2003»

Formalizing Multi-Agent POMDP's in the context of network routing

14 years 28 days ago

Download www.hicss.hawaii.edu

This paper uses partially observable Markov decision processes (POMDP’s) as a basic framework for MultiAgent planning. We distinguish three perspectives: ﬁrst one is that of a...

Bharaneedharan Rathnasabapathy, Piotr J. Gmytrasie...

claim paper

Read More »

click to vote

ATAL
2003
Springer

152views Intelligent Agents» more ATAL 2003»

Transition-independent decentralized markov decision processes

14 years 27 days ago

Download anytime.cs.umass.edu

There has been substantial progress with formal models for sequential decision making by individual agents using the Markov decision process (MDP). However, similar treatment of m...

Raphen Becker, Shlomo Zilberstein, Victor R. Lesse...

claim paper

Read More »

click to vote

CORR
2006
Springer

130views Education» more CORR 2006»

On optimal quantization rules for some sequential decision problems

13 years 7 months ago

Download www.stat.berkeley.edu

We consider the problem of sequential decentralized detection, a problem that entails several interdependent choices: the choice of a stopping rule (specifying the sample size), a...

XuanLong Nguyen, Martin J. Wainwright, Michael I. ...

claim paper

Read More »

click to vote

AAAI
2010

136views Intelligent Agents» more AAAI 2010»

Robust Policy Computation in Reward-Uncertain MDPs Using Nondominated Policies

13 years 9 months ago

Download www.cs.toronto.edu

The precise specification of reward functions for Markov decision processes (MDPs) is often extremely difficult, motivating research into both reward elicitation and the robust so...

Kevin Regan, Craig Boutilier

claim paper

Read More »

« Prev « First page 5 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers