Search Sciweavers | Sciweavers

1138 search results - page 72 / 228

» Feature Markov Decision Processes

124

click to vote

NIPS
2001

158views Information Technology» more NIPS 2001»

Multiagent Planning with Factored MDPs

15 years 4 months ago

Download books.nips.cc

We present a principled and efficient planning algorithm for cooperative multiagent dynamic systems. A striking feature of our method is that the coordination and communication be...

Carlos Guestrin, Daphne Koller, Ronald Parr

claim paper

Read More »

139

click to vote

CJ
2004

141views more CJ 2004»

Modeling and Analysis of a Scheduled Maintenance System: a DSPN Approach

15 years 2 months ago

Download dcl.isti.cnr.it

This paper describes a way to manage the modeling and analysis of Scheduled Maintenance Systems (SMS) within an analytically tractable context. We chose a significant case study h...

Andrea Bondavalli, Roberto Filippini

claim paper

Read More »

112

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 3 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

109

click to vote

ICALP
2009
Springer

92views Programming Languages» more ICALP 2009»

Reachability in Stochastic Timed Games

16 years 3 months ago

Download www.lsv.ens-cachan.fr

We define stochastic timed games, which extend two-player timed games with probabilities (following a recent approach by Baier et al), and which extend in a natural way continuous-...

Patricia Bouyer, Vojtech Forejt

claim paper

Read More »

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

15 years 9 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

« Prev « First page 72 / 228 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers