Search Sciweavers | Sciweavers

499 search results - page 41 / 100

» Model Minimization in Markov Decision Processes

142

Voted

ACL
2008

136views Computational Linguistics» more ACL 2008»

Mixture Model POMDPs for Efficient Handling of Uncertainty in Dialogue Management

15 years 6 months ago

Download www.classic-project.org

In spoken dialogue systems, Partially Observable Markov Decision Processes (POMDPs) provide a formal framework for making dialogue management decisions under uncertainty, but effi...

James Henderson, Oliver Lemon

claim paper

Read More »

148

click to vote

ATAL
2010
Springer

136views Intelligent Agents» more ATAL 2010»

Quasi deterministic POMDPs and DecPOMDPs

15 years 5 months ago

Download www.damas.ift.ulaval.ca

In this paper, we study a particular subclass of partially observable models, called quasi-deterministic partially observable Markov decision processes (QDET-POMDPs), characterize...

Camille Besse, Brahim Chaib-draa

claim paper

Read More »

227

click to vote

ILP
2007
Springer

283views Automated Reasoning» more ILP 2007»

Building Relational World Models for Reinforcement Learning

15 years 11 months ago

Download ftp.cs.wisc.edu

Abstract. Many reinforcement learning domains are highly relational. While traditional temporal-difference methods can be applied to these domains, they are limited in their capaci...

Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richa...

claim paper

Read More »

169

click to vote

ECML
2007
Springer

170views Machine Learning» more ECML 2007»

Sequence Labeling with Reinforcement Learning and Ranking Algorithms

15 years 6 months ago

Download nieme.lip6.fr

Many problems in areas such as Natural Language Processing, Information Retrieval, or Bioinformatic involve the generic task of sequence labeling. In many cases, the aim is to assi...

Francis Maes, Ludovic Denoyer, Patrick Gallinari

claim paper

Read More »

164

click to vote

ICML
1995
IEEE

213views Machine Learning» more ICML 1995»

Learning Policies for Partially Observable Environments: Scaling Up

16 years 5 months ago

Download reference.kfupm.edu.sa

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...

Michael L. Littman, Anthony R. Cassandra, Leslie P...

claim paper

Read More »

« Prev « First page 41 / 100 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers