Search Sciweavers | Sciweavers

350 search results - page 40 / 70

» Complexity of Planning with Partial Observability

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

14 years 2 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

click to vote

AINA
2007
IEEE

141views Computer Networks» more AINA 2007»

Domain Modelling for Ubiquitous Computing Applications

13 years 12 months ago

Download www.tara.tcd.ie

Many Ubiquitous computing applications can be considered as planning and acting problems in environments characterised by uncertainty and partial observability. Such systems rely ...

Anthony Harrington, Vinny Cahill

claim paper

Read More »

click to vote

AAAI
2012

191views Intelligent Agents» more AAAI 2012»

Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication

11 years 10 months ago

Download people.csail.mit.edu

Planning under uncertainty is an important and challenging problem in multiagent systems. Multiagent Partially Observable Markov Decision Processes (MPOMDPs) provide a powerful fr...

Frans Adriaan Oliehoek, Matthijs T. J. Spaan

claim paper

Read More »

click to vote

ICASSP
2008
IEEE

215views Signal Processing» more ICASSP 2008»

Bayesian update of dialogue state for robust dialogue systems

14 years 2 months ago

Download mi.eng.cam.ac.uk

This paper presents a new framework for accumulating beliefs in spoken dialogue systems. The technique is based on updating a Bayesian Network that represents the underlying state...

Blaise Thomson, Jost Schatzmann, Steve Young

claim paper

Read More »

click to vote

TACAS
2007
Springer

116views Algorithms» more TACAS 2007»

Model Checking on Trees with Path Equivalences

14 years 1 months ago

Download www.cis.upenn.edu

For specifying and verifying branching-time requirements, a reactive system is traditionally modeled as a labeled tree, where a path in the tree encodes a possible execution of the...

Rajeev Alur, Pavol Cerný, Swarat Chaudhuri

claim paper

Read More »

« Prev « First page 40 / 70 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers