Search Sciweavers | Sciweavers

1138 search results - page 13 / 228

» Feature Markov Decision Processes

199

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

15 years 8 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

192

click to vote

AAAI
1998

129views Intelligent Agents» more AAAI 1998»

Solving Very Large Weakly Coupled Markov Decision Processes

15 years 8 months ago

Download www.cs.toronto.edu

We present a technique for computing approximately optimal solutions to stochastic resource allocation problems modeled as Markov decision processes (MDPs). We exploit two key pro...

Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, L...

claim paper

Read More »

187

click to vote

ATAL
2007
Springer

185views Intelligent Agents» more ATAL 2007»

On opportunistic techniques for solving decentralized Markov decision processes with temporal constraints

16 years 1 months ago

Download www.aamas-conference.org

Decentralized Markov Decision Processes (DEC-MDPs) are a popular model of agent-coordination problems in domains with uncertainty and time constraints but very difﬁcult to solve...

Janusz Marecki, Milind Tambe

claim paper

Read More »

263

click to vote

PAMI
2007

186views more PAMI 2007»

Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes

15 years 6 months ago

Download people.ee.duke.edu

—This paper presents a method for learning decision theoretic models of human behaviors from video data. Our system learns relationships between the movements of a person, the co...

Jesse Hoey, James J. Little

claim paper

Read More »

159

click to vote

DSN
2006
IEEE

151views Computer Networks» more DSN 2006»

Automatic Recovery Using Bounded Partially Observable Markov Decision Processes

16 years 1 months ago

Download www.perform.csl.illinois.edu

This paper provides a technique, based on partially observable Markov decision processes (POMDPs), for building automatic recovery controllers to guide distributed system recovery...

Kaustubh R. Joshi, William H. Sanders, Matti A. Hi...

claim paper

Read More »

« Prev « First page 13 / 228 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers