Search Sciweavers | Sciweavers

We consider an MDP setting in which the reward function is allowed to change during each time step of play (possibly in an adversarial manner), yet the dynamics remain fixed. Simi...

Eyal Even-Dar, Sham M. Kakade, Yishay Mansour

claim paper

Read More »

267

click to vote

PAMI
2007

186views more PAMI 2007»

Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes

15 years 6 months ago

Download people.ee.duke.edu

—This paper presents a method for learning decision theoretic models of human behaviors from video data. Our system learns relationships between the movements of a person, the co...

Jesse Hoey, James J. Little

claim paper

Read More »

186

click to vote

AUTOMATICA
2008

104views more AUTOMATICA 2008»

Exact finite approximations of average-cost countable Markov decision processes

15 years 7 months ago

Download webee.technion.ac.il

For a countable-state Markov decision process we introduce an embedding which produces a finite-state Markov decision process. The finite-state embedded process has the same optim...

Arie Leizarowitz, Adam Shwartz

claim paper

Read More »

191

click to vote

AAAI
1996

197views Intelligent Agents» more AAAI 1996»

Computing Optimal Policies for Partially Observable Decision Processes Using Compact Representations

15 years 8 months ago

Download people.cs.ubc.ca

: Partially-observable Markov decision processes provide a very general model for decision-theoretic planning problems, allowing the trade-offs between various courses of actions t...

Craig Boutilier, David Poole

claim paper

Read More »

« Prev « First page 1 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers