Search Sciweavers | Sciweavers

87 search results - page 15 / 18

» A policy iteration algorithm for Markov decision processes s...

click to vote

ICTAI
2010
IEEE

226views Artificial Intelligence» more ICTAI 2010»

A Closer Look at MOMDPs

13 years 5 months ago

Download www.loria.fr

Abstract--The difficulties encountered in sequential decisionmaking problems under uncertainty are often linked to the large size of the state space. Exploiting the structure of th...

Mauricio Araya-López, Vincent Thomas, Olivi...

claim paper

Read More »

click to vote

ICML
2004
IEEE

214views Machine Learning» more ICML 2004»

Apprenticeship learning via inverse reinforcement learning

14 years 8 months ago

Download ai.stanford.edu

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we wa...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

click to vote

ATAL
2009
Springer

103views Intelligent Agents» more ATAL 2009»

Lossless clustering of histories in decentralized POMDPs

14 years 2 months ago

Download www.science.uva.nl

Decentralized partially observable Markov decision processes (Dec-POMDPs) constitute a generic and expressive framework for multiagent planning under uncertainty. However, plannin...

Frans A. Oliehoek, Shimon Whiteson, Matthijs T. J....

claim paper

Read More »

click to vote

ROBOCUP
2007
Springer

99views Robotics» more ROBOCUP 2007»

Instance-Based Action Models for Fast Action Planning

14 years 1 months ago

Download userweb.cs.utexas.edu

Abstract. Two main challenges of robot action planning in real domains are uncertain action eﬀects and dynamic environments. In this paper, an instance-based action model is lear...

Mazda Ahmadi, Peter Stone

claim paper

Read More »

click to vote

ATAL
2009
Springer

205views Intelligent Agents» more ATAL 2009»

Point-based incremental pruning heuristic for solving finite-horizon DEC-POMDPs

14 years 2 months ago

Download www.aamas-conference.org

Recent scaling up of decentralized partially observable Markov decision process (DEC-POMDP) solvers towards realistic applications is mainly due to approximate methods. Of this fa...

Jilles Steeve Dibangoye, Abdel-Illah Mouaddib, Bra...

claim paper

Read More »

« Prev « First page 15 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers