Search Sciweavers | Sciweavers

337 search results - page 30 / 68

» Mean-Variance Optimization in Markov Decision Processes

155

Voted

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

15 years 6 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

167

click to vote

AAAI
1996

119views Intelligent Agents» more AAAI 1996»

Rewarding Behaviors

15 years 6 months ago

Download www.cs.toronto.edu

Markov decision processes (MDPs) are a very popular tool for decision theoretic planning (DTP), partly because of the welldeveloped, expressive theory that includes effective solu...

Fahiem Bacchus, Craig Boutilier, Adam J. Grove

claim paper

Read More »

138

click to vote

MOR
2008

87views more MOR 2008»

On Near Optimality of the Set of Finite-State Controllers for Average Cost POMDP

15 years 5 months ago

Download www.cs.helsinki.fi

We consider the average cost problem for partially observable Markov decision processes (POMDP) with finite state, observation, and control spaces. We prove that there exists an -...

Huizhen Yu, Dimitri P. Bertsekas

claim paper

Read More »

137

Voted

AAAI
2007

117views Intelligent Agents» more AAAI 2007»

Optimizing Anthrax Outbreak Detection Using Reinforcement Learning

15 years 7 months ago

Download www.aaai.org

The potentially catastrophic impact of a bioterrorist attack makes developing effective detection methods essential for public health. In the case of anthrax attack, a delay of ho...

Masoumeh T. Izadi, David L. Buckeridge

claim paper

Read More »

154

Voted

AMAI
2006
Springer

123views Artificial Intelligence» more AMAI 2006»

Symmetric approximate linear programming for factored MDPs with application to constrained problems

15 years 5 months ago

Download ai.stanford.edu

A weakness of classical Markov decision processes (MDPs) is that they scale very poorly due to the flat state-space representation. Factored MDPs address this representational pro...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

« Prev « First page 30 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers