Search Sciweavers | Sciweavers

185 search results - page 5 / 37

» Simulation-Based Optimization Algorithms for Finite-Horizon ...

click to vote

ECML
2005
Springer

143views Machine Learning» more ECML 2005»

Active Learning in Partially Observable Markov Decision Processes

14 years 1 months ago

Download www.cs.mcgill.ca

This paper examines the problem of ﬁnding an optimal policy for a Partially Observable Markov Decision Process (POMDP) when the model is not known or is only poorly speciﬁed. W...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

13 years 9 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

click to vote

UAI
2000

168views Artificial Intelligence» more UAI 2000»

The Complexity of Decentralized Control of Markov Decision Processes

13 years 9 months ago

Download www.cs.umass.edu

We consider decentralized control of Markov decision processes and give complexity bounds on the worst-case running time for algorithms that find optimal solutions. Generalization...

Daniel S. Bernstein, Shlomo Zilberstein, Neil Imme...

claim paper

Read More »

click to vote

SODA
2010
ACM

190views Algorithms» more SODA 2010»

One-Counter Markov Decision Processes

14 years 5 months ago

Download www.fi.muni.cz

We study the computational complexity of some central analysis problems for One-Counter Markov Decision Processes (OC-MDPs), a class of finitely-presented, countable-state MDPs. O...

Tomas Brazdil, Vaclav Brozek, Kousha Etessami, Ant...

claim paper

Read More »

click to vote

FLAIRS
2008

108views Artificial Intelligence» more FLAIRS 2008»

A Novel Prioritization Technique for Solving Markov Decision Processes

13 years 10 months ago

Download damas.ift.ulaval.ca

We address the problem of computing an optimal value function for Markov decision processes. Since finding this function quickly and accurately requires substantial computation ef...

Jilles Steeve Dibangoye, Brahim Chaib-draa, Abdel-...

claim paper

Read More »

« Prev « First page 5 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers