Search Sciweavers | Sciweavers

802 search results - page 67 / 161

» Experts in a Markov Decision Process

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

14 years 8 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

click to vote

ICALP
2009
Springer

92views Programming Languages» more ICALP 2009»

Reachability in Stochastic Timed Games

14 years 8 months ago

Download www.lsv.ens-cachan.fr

We define stochastic timed games, which extend two-player timed games with probabilities (following a recent approach by Baier et al), and which extend in a natural way continuous-...

Patricia Bouyer, Vojtech Forejt

claim paper

Read More »

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

14 years 2 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

click to vote

GECCO
2004
Springer

142views Optimization» more GECCO 2004»

Improving MACS Thanks to a Comparison with 2TBNs

14 years 1 months ago

Download www.cs.york.ac.uk

Abstract. Factored Markov Decision Processes is the theoretical framework underlying multi-step Learning Classiﬁer Systems research. This framework is mostly used in the context ...

Olivier Sigaud, Thierry Gourdin, Pierre-Henri Wuil...

claim paper

Read More »

click to vote

ECSQARU
2001
Springer

118views Automated Reasoning» more ECSQARU 2001»

Space-Progressive Value Iteration: An Anytime Algorithm for a Class of POMDPs

14 years 9 days ago

Download www.cs.ust.hk

Abstract. Finding optimal policies for general partially observable Markov decision processes (POMDPs) is computationally difﬁcult primarily due to the need to perform dynamic-pr...

Nevin Lianwen Zhang, Weihong Zhang

claim paper

Read More »

« Prev « First page 67 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers