Search Sciweavers | Sciweavers

802 search results - page 14 / 161

» Experts in a Markov Decision Process

162

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

15 years 7 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

159

click to vote

AAAI
1998

129views Intelligent Agents» more AAAI 1998»

Solving Very Large Weakly Coupled Markov Decision Processes

15 years 7 months ago

Download www.cs.toronto.edu

We present a technique for computing approximately optimal solutions to stochastic resource allocation problems modeled as Markov decision processes (MDPs). We exploit two key pro...

Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, L...

claim paper

Read More »

159

click to vote

ATAL
2007
Springer

185views Intelligent Agents» more ATAL 2007»

On opportunistic techniques for solving decentralized Markov decision processes with temporal constraints

15 years 11 months ago

Download www.aamas-conference.org

Decentralized Markov Decision Processes (DEC-MDPs) are a popular model of agent-coordination problems in domains with uncertainty and time constraints but very difﬁcult to solve...

Janusz Marecki, Milind Tambe

claim paper

Read More »

140

click to vote

DSN
2006
IEEE

151views Computer Networks» more DSN 2006»

Automatic Recovery Using Bounded Partially Observable Markov Decision Processes

15 years 11 months ago

Download www.perform.csl.illinois.edu

This paper provides a technique, based on partially observable Markov decision processes (POMDPs), for building automatic recovery controllers to guide distributed system recovery...

Kaustubh R. Joshi, William H. Sanders, Matti A. Hi...

claim paper

Read More »

153

click to vote

ICC
2007
IEEE

137views Communications» more ICC 2007»

Optimality and Complexity of Opportunistic Spectrum Access: A Truncated Markov Decision Process Formulation

15 years 12 months ago

Download www.ece.ucdavis.edu

— We consider opportunistic spectrum access (OSA) which allows secondary users to identify and exploit instantaneous spectrum opportunities resulting from the bursty trafﬁc of ...

Dejan V. Djonin, Qing Zhao, Vikram Krishnamurthy

claim paper

Read More »

« Prev « First page 14 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers