Search Sciweavers | Sciweavers

98 search results - page 7 / 20

» Using Rewards for Belief State Updates in Partially Observab...

click to vote

AAAI
2010

201views Intelligent Agents» more AAAI 2010»

Compressing POMDPs Using Locality Preserving Non-Negative Matrix Factorization

13 years 9 months ago

Download www.cs.umass.edu

Partially Observable Markov Decision Processes (POMDPs) are a well-established and rigorous framework for sequential decision-making under uncertainty. POMDPs are well-known to be...

Georgios Theocharous, Sridhar Mahadevan

claim paper

Read More »

click to vote

AAAI
2000

144views Intelligent Agents» more AAAI 2000»

Back to the Future for Consistency-Based Trajectory Tracking

13 years 9 months ago

Download people.csail.mit.edu

Given a model of a physical process and a sequence of commands and observations received over time, the task of an autonomous controller is to determine the likely states of the p...

James Kurien, P. Pandurang Nayak

claim paper

Read More »

click to vote

ICML
2004
IEEE

120views Machine Learning» more ICML 2004»

Utile distinction hidden Markov models

14 years 8 months ago

Download www.idsia.ch

This paper addresses the problem of constructing good action selection policies for agents acting in partially observable environments, a class of problems generally known as Part...

Daan Wierstra, Marco Wiering

claim paper

Read More »

click to vote

CDC
2008
IEEE

140views Control Systems» more CDC 2008»

Information state for Markov decision processes with network delays

14 years 2 months ago

Download wsl.stanford.edu

We consider a networked control system, where each subsystem evolves as a Markov decision process (MDP). Each subsystem is coupled to its neighbors via communication links over wh...

Sachin Adlakha, Sanjay Lall, Andrea J. Goldsmith

claim paper

Read More »

click to vote

HICSS
2003
IEEE

123views Biometrics» more HICSS 2003»

Issues in Rational Planning in Multi-Agent Settings

14 years 27 days ago

Download www.hicss.hawaii.edu

We adopt the decision-theoretic principle of expected utility maximization as a paradigm for designing autonomous rational agents operating in multi-agent environments. We use the...

Piotr J. Gmytrasiewicz

claim paper

Read More »

« Prev « First page 7 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers