observable markov decision

143

AAAI
2012

191views Intelligent Agents» more AAAI 2012»

Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication

13 years 4 months ago

Planning under uncertainty is an important and challenging problem in multiagent systems. Multiagent Partially Observable Markov Decision Processes (MPOMDPs) provide a powerful fr...

Frans Adriaan Oliehoek, Matthijs T. J. Spaan

claim paper

Read More »

105

Voted

AAAI
2012

215views Intelligent Agents» more AAAI 2012»

POMDPs Make Better Hackers: Accounting for Uncertainty in Penetration Testing

13 years 4 months ago

Download corelabs.coresecurity.com

Penetration Testing is a methodology for assessing network security, by generating and executing possible hacking attacks. Doing so automatically allows for regular and systematic...

Carlos Sarraute, Olivier Buffet, Jörg Hoffman...

claim paper

Read More »

185

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

13 years 9 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

169

click to vote

AAAI
2011

246views Intelligent Agents» more AAAI 2011»

An Online Spectral Learning Algorithm for Partially Observable Nonlinear Dynamical Systems

14 years 2 months ago

Download www.cs.cmu.edu

Recently, a number of researchers have proposed spectral algorithms for learning models of dynamical systems—for example, Hidden Markov Models (HMMs), Partially Observable Marko...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

150

click to vote

AIED
2011
Springer

243views Artificial Intelligence» more AIED 2011»

Faster Teaching by POMDP Planning

14 years 5 months ago

Download louisville.edu

Both human and automated tutors must infer what a student knows and plan future actions to maximize learning. Though substantial research has been done on tracking and modeling stu...

Anna N. Rafferty, Emma Brunskill, Thomas L. Griffi...

claim paper

Read More »

131

click to vote

COLING
2010

138views Computational Linguistics» more COLING 2010»

Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes

14 years 9 months ago

Download aclweb.org

This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We coll...

Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Min...

claim paper

Read More »

181

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

15 years 13 hour ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

131

click to vote

ICTAI
2010
IEEE

226views Artificial Intelligence» more ICTAI 2010»

A Closer Look at MOMDPs

15 years 2 days ago

Download www.loria.fr

Abstract--The difficulties encountered in sequential decisionmaking problems under uncertainty are often linked to the large size of the state space. Exploiting the structure of th...

Mauricio Araya-López, Vincent Thomas, Olivi...

claim paper

Read More »

118

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 16 days ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

154

click to vote

IJRR
2010

162views more IJRR 2010»

Planning under Uncertainty for Robotic Tasks with Mixed Observability

15 years 18 days ago

Download motion.comp.nus.edu.sg

Partially observable Markov decision processes (POMDPs) provide a principled, general framework for robot motion planning in uncertain and dynamic environments. They have been app...

Sylvie C. W. Ong, Shao Wei Png, David Hsu, Wee Sun...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers