Search Sciweavers | Sciweavers

169 search results - page 20 / 34

» Planning with Continuous Actions in Partially Observable Env...

click to vote

ATAL
2006
Springer

157views Intelligent Agents» more ATAL 2006»

Decentralized planning under uncertainty for teams of communicating agents

13 years 11 months ago

Download www.cs.cmu.edu

Decentralized partially observable Markov decision processes (DEC-POMDPs) form a general framework for planning for groups of cooperating agents that inhabit a stochastic and part...

Matthijs T. J. Spaan, Geoffrey J. Gordon, Nikos A....

claim paper

Read More »

click to vote

IJCAI
2003

108views Artificial Intelligence» more IJCAI 2003»

Logical Filtering

13 years 9 months ago

Download dli.iiit.ac.in

Filtering denotes any method whereby an agent updates its belief state—its knowledge of the state of the world—from a sequence of actions and observations. In logical filterin...

Eyal Amir, Stuart J. Russell

claim paper

Read More »

click to vote

ATAL
2007
Springer

127views Intelligent Agents» more ATAL 2007»

Real-time agent characterization and prediction

14 years 1 months ago

Download www.aamas-conference.org

Reasoning about agents that we observe in the world is challenging. Our available information is often limited to observations of the agent’s external behavior in the past and p...

H. Van Dyke Parunak, Sven Brueckner, Robert S. Mat...

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

14 years 1 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

click to vote

IPPS
2009
IEEE

119views Distributed And Parallel Com...» more IPPS 2009»

Crash fault detection in celerating environments

14 years 2 months ago

Download srikanth.sastry.name

Failure detectors are a service that provides (approximate) information about process crashes in a distributed system. The well-known “eventually perfect” failure detector, 3P...

Srikanth Sastry, Scott M. Pike, Jennifer L. Welch

claim paper

Read More »

« Prev « First page 20 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers