Search Sciweavers | Sciweavers

1176 search results - page 126 / 236

» Sparse reward processes

143

click to vote

IROS
2006
IEEE

121views Robotics» more IROS 2006»

Planning and Acting in Uncertain Environments using Probabilistic Inference

15 years 10 months ago

Download www.cs.washington.edu

— An important problem in robotics is planning and selecting actions for goal-directed behavior in noisy uncertain environments. The problem is typically addressed within the fra...

Deepak Verma, Rajesh P. N. Rao

claim paper

Read More »

146

click to vote

IAT
2005
IEEE

132views Intelligent Agents» more IAT 2005»

Decomposing Large-Scale POMDP Via Belief State Analysis

15 years 10 months ago

Download www.comp.hkbu.edu.hk

Partially observable Markov decision process (POMDP) is commonly used to model a stochastic environment with unobservable states for supporting optimal decision making. Computing ...

Xin Li, William K. Cheung, Jiming Liu

claim paper

Read More »

139

click to vote

AAAI
2008

144views Intelligent Agents» more AAAI 2008»

A Variance Analysis for POMDP Policy Evaluation

15 years 6 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes have been studied widely as a model for decision making under uncertainty, and a number of methods have been developed to find the s...

Mahdi Milani Fard, Joelle Pineau, Peng Sun

claim paper

Read More »

131

click to vote

ATAL
2008
Springer

103views Intelligent Agents» more ATAL 2008»

The permutable POMDP: fast solutions to POMDPs for preference elicitation

15 years 6 months ago

Download mapleleaf.csail.mit.edu

The ability for an agent to reason under uncertainty is crucial for many planning applications, since an agent rarely has access to complete, error-free information about its envi...

Finale Doshi, Nicholas Roy

claim paper

Read More »

146

Voted

IJCAI
2003

142views Artificial Intelligence» more IJCAI 2003»

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

15 years 5 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...

claim paper

Read More »

« Prev « First page 126 / 236 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers