Search Sciweavers | Sciweavers

201 search results - page 8 / 41

» Solving Concurrent Markov Decision Processes

click to vote

ICRA
2007
IEEE

154views Robotics» more ICRA 2007»

Oracular Partially Observable Markov Decision Processes: A Very Special Case

14 years 1 months ago

Download www.cs.cmu.edu

— We introduce the Oracular Partially Observable Markov Decision Process (OPOMDP), a type of POMDP in which the world produces no observations; instead there is an “oracle,” ...

Nicholas Armstrong-Crews, Manuela M. Veloso

claim paper

Read More »

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

13 years 8 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

click to vote

UAI
1998

115views Artificial Intelligence» more UAI 1998»

Structured Reachability Analysis for Markov Decision Processes

13 years 8 months ago

Download www.cs.toronto.edu

Recent research in decision theoretic planning has focussedon making the solution of Markov decision processes (MDPs) more feasible. We develop a family of algorithms for structur...

Craig Boutilier, Ronen I. Brafman, Christopher W. ...

claim paper

Read More »

click to vote

ICTAI
2000
IEEE

186views Artificial Intelligence» more ICTAI 2000»

Building efficient partial plans using Markov decision processes

13 years 11 months ago

Download ccc.inaoep.mx

Markov Decision Processes (MDP) have been widely used as a framework for planning under uncertainty. They allow to compute optimal sequences of actions in order to achieve a given...

Pierre Laroche

claim paper

Read More »

click to vote

CDC
2009
IEEE

169views Control Systems» more CDC 2009»

Parametric regret in uncertain Markov decision processes

14 years 4 days ago

Download www.cim.mcgill.ca

— We consider decision making in a Markovian setup where the reward parameters are not known in advance. Our performance criterion is the gap between the performance of the best ...

Huan Xu, Shie Mannor

claim paper

Read More »

« Prev « First page 8 / 41 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers