Search Sciweavers | Sciweavers

260 search results - page 39 / 52

» Quasi-Deterministic Partially Observable Markov Decision Pro...

122

click to vote

ATAL
2004
Springer

120views Intelligent Agents» more ATAL 2004»

Communication for Improving Policy Computation in Distributed POMDPs

15 years 9 months ago

Download teamcore.usc.edu

Distributed Partially Observable Markov Decision Problems (POMDPs) are emerging as a popular approach for modeling multiagent teamwork where a group of agents work together to joi...

Ranjit Nair, Milind Tambe, Maayan Roth, Makoto Yok...

claim paper

Read More »

141

Voted

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 2 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

162

click to vote

GECCO
2008
Springer

179views Optimization» more GECCO 2008»

Emergent architecture in self organized swarm systems for military applications

15 years 5 months ago

Download www.cs.bham.ac.uk

Many sectors of the military are interested in Self-Organized (SO) systems because of their ﬂexibility, versatility and economics. The military is researching and employing auto...

Dustin J. Nowak, Gary B. Lamont, Gilbert L. Peters...

claim paper

Read More »

127

click to vote

AAAI
2007

117views Intelligent Agents» more AAAI 2007»

Optimizing Anthrax Outbreak Detection Using Reinforcement Learning

15 years 6 months ago

Download www.aaai.org

The potentially catastrophic impact of a bioterrorist attack makes developing effective detection methods essential for public health. In the case of anthrax attack, a delay of ho...

Masoumeh T. Izadi, David L. Buckeridge

claim paper

Read More »

141

click to vote

IJCAI
2001

174views Artificial Intelligence» more IJCAI 2001»

Complexity of Probabilistic Planning under Average Rewards

15 years 5 months ago

Download www.informatik.uni-freiburg.de

A general and expressive model of sequential decision making under uncertainty is provided by the Markov decision processes (MDPs) framework. Complex applications with very large ...

Jussi Rintanen

claim paper

Read More »

« Prev « First page 39 / 52 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers