Search Sciweavers | Sciweavers

91 search results - page 10 / 19

» Percentile Optimization for Markov Decision Processes with P...

click to vote

ECCV
2004
Springer

361views Computer Vision» more ECCV 2004»

Decision Theoretic Modeling of Human Facial Displays

14 years 9 months ago

Download people.cs.ubc.ca

We present a vision based, adaptive, decision theoretic model of human facial displays in interactions. The model is a partially observable Markov decision process, or POMDP. A POM...

Jesse Hoey, James J. Little

claim paper

Read More »

click to vote

IJCAI
2003

123views Artificial Intelligence» more IJCAI 2003»

Automated Generation of Understandable Contingency Plans

13 years 9 months ago

Download anytime.cs.umass.edu

Markov decision processes (MDPs) and contingency planning (CP) are two widely used approaches to planning under uncertainty. MDPs are attractive because the model is extremely gen...

Max Horstmann, Shlomo Zilberstein

claim paper

Read More »

click to vote

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

13 years 7 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

click to vote

AIPS
2009

144views Artificial Intelligence» more AIPS 2009»

Efficient Solutions to Factored MDPs with Imprecise Transition Probabilities

13 years 8 months ago

Download www.ime.usp.br

When modeling real-world decision-theoretic planning problems in the Markov decision process (MDP) framework, it is often impossible to obtain a completely accurate estimate of tr...

Karina Valdivia Delgado, Scott Sanner, Leliane Nun...

claim paper

Read More »

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

13 years 2 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

« Prev « First page 10 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers