Search Sciweavers | Sciweavers

87 search results - page 5 / 18

» A policy iteration algorithm for Markov decision processes s...

click to vote

ALDT
2009
Springer

142views Algorithms» more ALDT 2009»

Finding Best k Policies

14 years 2 months ago

Download www.cs.uky.edu

Abstract. An optimal probabilistic-planning algorithm solves a problem, usually modeled by a Markov decision process, by ﬁnding its optimal policy. In this paper, we study the k ...

Peng Dai, Judy Goldsmith

claim paper

Read More »

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

13 years 5 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

click to vote

CORR
2012
Springer

286views Education» more CORR 2012»

A Faster Algorithm for Solving One-Clock Priced Timed Games

12 years 3 months ago

Download www.daimi.au.dk

One-clock priced timed games is a class of two-player, zero-sum, continuous-time games that was deﬁned and thoroughly studied in previous works. We show that One-clock priced ti...

Thomas Dueholm Hansen, Rasmus Ibsen-Jensen, Peter ...

claim paper

Read More »

click to vote

CAV
2007
Springer

112views Hardware» more CAV 2007»

Magnifying-Lens Abstraction for Markov Decision Processes

14 years 1 months ago

Download www.ee.ucla.edu

ng-Lens Abstraction for Markov Decision Processes⋆ In Proc. of CAV 2007: 19th International Conference on Computer-Aided Veriﬁcation, Lectures Notes in Computer Science. c Spri...

Luca de Alfaro, Pritam Roy

claim paper

Read More »

click to vote

AAAI
2006

146views Intelligent Agents» more AAAI 2006»

Incremental Least Squares Policy Iteration for POMDPs

13 years 9 months ago

Download www.aaai.org

We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...

Hui Li, Xuejun Liao, Lawrence Carin

claim paper

Read More »

« Prev « First page 5 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers