Search Sciweavers | Sciweavers

683 search results - page 57 / 137

» Coarticulation in Markov Decision Processes

click to vote

ATAL
2011
Springer

169views Intelligent Agents» more ATAL 2011»

Towards a unifying characterization for quantifying weak coupling in dec-POMDPs

12 years 10 months ago

Download ai.eecs.umich.edu

Researchers in the ﬁeld of multiagent sequential decision making have commonly used the terms “weakly-coupled” and “loosely-coupled” to qualitatively classify problems i...

Stefan J. Witwicki, Edmund H. Durfee

claim paper

Read More »

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

14 years 11 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

click to vote

ICALP
2009
Springer

92views Programming Languages» more ICALP 2009»

Reachability in Stochastic Timed Games

14 years 10 months ago

Download www.lsv.ens-cachan.fr

We define stochastic timed games, which extend two-player timed games with probabilities (following a recent approach by Baier et al), and which extend in a natural way continuous-...

Patricia Bouyer, Vojtech Forejt

claim paper

Read More »

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

14 years 4 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

click to vote

GECCO
2004
Springer

142views Optimization» more GECCO 2004»

Improving MACS Thanks to a Comparison with 2TBNs

14 years 3 months ago

Download www.cs.york.ac.uk

Abstract. Factored Markov Decision Processes is the theoretical framework underlying multi-step Learning Classiﬁer Systems research. This framework is mostly used in the context ...

Olivier Sigaud, Thierry Gourdin, Pierre-Henri Wuil...

claim paper

Read More »

« Prev « First page 57 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers