Search Sciweavers | Sciweavers

87 search results - page 3 / 18

» A policy iteration algorithm for Markov decision processes s...

click to vote

CORR
2012
Springer

235views Education» more CORR 2012»

An Incremental Sampling-based Algorithm for Stochastic Optimal Control

12 years 3 months ago

Download www.mit.edu

Abstract— In this paper, we consider a class of continuoustime, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation ...

Vu Anh Huynh, Sertac Karaman, Emilio Frazzoli

claim paper

Read More »

click to vote

IJCAI
2007

182views Artificial Intelligence» more IJCAI 2007»

A Fast Analytical Algorithm for Solving Markov Decision Processes with Real-Valued Resources

13 years 9 months ago

Download teamcore.usc.edu

Agents often have to construct plans that obey deadlines or, more generally, resource limits for real-valued resources whose consumption can only be characterized by probability d...

Janusz Marecki, Sven Koenig, Milind Tambe

claim paper

Read More »

click to vote

JMLR
2002

100views more JMLR 2002»

On the Convergence of Optimistic Policy Iteration

13 years 7 months ago

Download www.mit.edu

We consider a finite-state Markov decision problem and establish the convergence of a special case of optimistic policy iteration that involves Monte Carlo estimation of Q-values,...

John N. Tsitsiklis

claim paper

Read More »

click to vote

AAAI
1997

139views Intelligent Agents» more AAAI 1997»

Model Minimization in Markov Decision Processes

13 years 8 months ago

Download www.cs.brown.edu

Many stochastic planning problems can be represented using Markov Decision Processes (MDPs). A difficulty with using these MDP representations is that the common algorithms for so...

Thomas Dean, Robert Givan

claim paper

Read More »

click to vote

ECSQARU
2001
Springer

118views Automated Reasoning» more ECSQARU 2001»

Space-Progressive Value Iteration: An Anytime Algorithm for a Class of POMDPs

13 years 12 months ago

Download www.cs.ust.hk

Abstract. Finding optimal policies for general partially observable Markov decision processes (POMDPs) is computationally difﬁcult primarily due to the need to perform dynamic-pr...

Nevin Lianwen Zhang, Weihong Zhang

claim paper

Read More »

« Prev « First page 3 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers