Search Sciweavers | Sciweavers

260 search results - page 26 / 52

» Quasi-Deterministic Partially Observable Markov Decision Pro...

140

click to vote

ATAL
2008
Springer

103views Intelligent Agents» more ATAL 2008»

The permutable POMDP: fast solutions to POMDPs for preference elicitation

15 years 7 months ago

Download mapleleaf.csail.mit.edu

The ability for an agent to reason under uncertainty is crucial for many planning applications, since an agent rarely has access to complete, error-free information about its envi...

Finale Doshi, Nicholas Roy

claim paper

Read More »

119

click to vote

PRIMA
2007
Springer

98views Intelligent Agents» more PRIMA 2007»

Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs

15 years 11 months ago

Download lang.is.kyushu-u.ac.jp

Multiagent Partially Observable Markov Decision Processes are a popular model of multiagent systems with uncertainty. Since the computational cost for ﬁnding an optimal joint pol...

Yuichi Yabu, Makoto Yokoo, Atsushi Iwasaki

claim paper

Read More »

176

click to vote

ENTCS
2006

134views more ENTCS 2006»

Partial Order Reduction for Probabilistic Branching Time

15 years 5 months ago

Download www.win.tue.nl

In the past, partial order reduction has been used successfully to combat the state explosion problem in the context of model checking for non-probabilistic systems. For both line...

Christel Baier, Pedro R. D'Argenio, Marcus Grö...

claim paper

Read More »

198

click to vote

AI
2011
Springer

211views Artificial Intelligence» more AI 2011»

Decentralized MDPs with sparse interactions

14 years 9 months ago

Download www.inesc-id.pt

In this work, we explore how local interactions can simplify the process of decision-making in multiagent systems, particularly in multirobot problems. We review a recent decision-...

Francisco S. Melo, Manuela M. Veloso

claim paper

Read More »

117

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 11 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

« Prev « First page 26 / 52 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers