Search Sciweavers | Sciweavers

135 search results - page 6 / 27

» Bounded Parameter Markov Decision Processes

242

click to vote

TACAS
2007
Springer

165views Algorithms» more TACAS 2007»

Multi-objective Model Checking of Markov Decision Processes

16 years 1 months ago

Download qav.comlab.ox.ac.uk

We study and provide eﬃcient algorithms for multi-objective model checking problems for Markov Decision Processes (MDPs). Given an MDP, M, and given multiple linear-time (ω-regu...

Kousha Etessami, Marta Z. Kwiatkowska, Moshe Y. Va...

claim paper

Read More »

272

Voted

PAMI
2007

186views more PAMI 2007»

Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes

15 years 7 months ago

Download people.ee.duke.edu

—This paper presents a method for learning decision theoretic models of human behaviors from video data. Our system learns relationships between the movements of a person, the co...

Jesse Hoey, James J. Little

claim paper

Read More »

192

click to vote

ATAL
2008
Springer

116views Intelligent Agents» more ATAL 2008»

Controlling deliberation in a Markov decision process-based agent

15 years 9 months ago

Download coitweb.uncc.edu

Meta-level control manages the allocation of limited resources to deliberative actions. This paper discusses efforts in adding meta-level control capabilities to a Markov Decision...

George Alexander, Anita Raja, David J. Musliner

claim paper

Read More »

146

click to vote

NIPS
2004

128views Information Technology» more NIPS 2004»

A Cost-Shaping LP for Bellman Error Minimization with Performance Guarantees

15 years 8 months ago

Download books.nips.cc

We introduce a new algorithm based on linear programming that approximates the differential value function of an average-cost Markov decision process via a linear combination of p...

Daniela Pucci de Farias, Benjamin Van Roy

claim paper

Read More »

166

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 11 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

« Prev « First page 6 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers