Search Sciweavers | Sciweavers

683 search results - page 70 / 137

» Coarticulation in Markov Decision Processes

click to vote

AIPS
2007

80views Artificial Intelligence» more AIPS 2007»

Prioritizing Bellman Backups without a Priority Queue

13 years 11 months ago

Download www.cs.washington.edu

Several researchers have shown that the efﬁciency of value iteration, a dynamic programming algorithm for Markov decision processes, can be improved by prioritizing the order of...

Peng Dai, Eric A. Hansen

claim paper

Read More »

click to vote

AIPS
2008

111views Artificial Intelligence» more AIPS 2008»

Multiagent Planning Under Uncertainty with Stochastic Communication Delays

13 years 11 months ago

Download www.aaai.org

We consider the problem of cooperative multiagent planning under uncertainty, formalized as a decentralized partially observable Markov decision process (Dec-POMDP). Unfortunately...

Matthijs T. J. Spaan, Frans A. Oliehoek, Nikos A. ...

claim paper

Read More »

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

13 years 11 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

click to vote

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains

13 years 10 months ago

Download www.eecs.umich.edu

We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...

Vishal Soni, Satinder P. Singh

claim paper

Read More »

click to vote

IJCAI
2003

142views Artificial Intelligence» more IJCAI 2003»

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

13 years 10 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...

claim paper

Read More »

« Prev « First page 70 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers