Search Sciweavers | Sciweavers

332 search results - page 11 / 67

» Ranking policies in discrete Markov decision processes

click to vote

UAI
1998

91views Artificial Intelligence» more UAI 1998»

Hierarchical Solution of Markov Decision Processes using Macro-actions

13 years 9 months ago

Download www.cs.toronto.edu

tigate the use of temporally abstract actions, or macro-actions, in the solution of Markov decision processes. Unlike current models that combine both primitive actions and macro-...

Milos Hauskrecht, Nicolas Meuleau, Leslie Pack Kae...

claim paper

Read More »

click to vote

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

14 years 8 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

click to vote

FGR
2006
IEEE

121views Biometrics» more FGR 2006»

Learning to Identify Facial Expression During Detection Using Markov Decision Process

14 years 2 months ago

Download www.cs.rutgers.edu

While there has been a great deal of research in face detection and recognition, there has been very limited work on identifying the expression on a face. Many current face detect...

Ramana Isukapalli, Ahmed M. Elgammal, Russell Grei...

claim paper

Read More »

click to vote

ATAL
2004
Springer

132views Intelligent Agents» more ATAL 2004»

Decentralized Markov Decision Processes with Event-Driven Interactions

14 years 1 months ago

Download anytime.cs.umass.edu

Decentralized MDPs provide a powerful formal framework for planning in multi-agent systems, but the complexity of the model limits its usefulness. We study in this paper a class o...

Raphen Becker, Shlomo Zilberstein, Victor R. Lesse...

claim paper

Read More »

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

13 years 8 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

« Prev « First page 11 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers