Sciweavers

802 search results - page 87 / 161
» Experts in a Markov Decision Process
Sort
View
GLOBECOM
2006
IEEE
14 years 3 months ago
Dynamic Wavelength Sharing Policies for Absolute QoS in OBS Networks
— We consider the problem of providing absolute QoS guarantees to multiple classes of users of an OBS network in terms of the end-to-end burst loss. We employ Markov decision pro...
Li Yang, George N. Rouskas
ICTAI
2006
IEEE
14 years 3 months ago
A New Hybrid GA-MDP Algorithm For The Frequency Assignment Problem
We propose a novel algorithm called GA-MDP for solving the frequency assigment problem. GA-MDP inherits the spirit of genetic algorithms with an adaptation of Markov Decision Proc...
Lhassane Idoumghar, René Schott
NAACL
2007
13 years 10 months ago
Comparing User Simulation Models For Dialog Strategy Learning
This paper explores what kind of user simulation model is suitable for developing a training corpus for using Markov Decision Processes (MDPs) to automatically learn dialog strate...
Hua Ai, Joel R. Tetreault, Diane J. Litman
NIPS
2007
13 years 10 months ago
Online Linear Regression and Its Application to Model-Based Reinforcement Learning
We provide a provably efficient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Specifically, we take a mo...
Alexander L. Strehl, Michael L. Littman
UAI
2000
13 years 10 months ago
Approximately Optimal Monitoring of Plan Preconditions
Monitoring plan preconditions can allow for replanning when a precondition fails, generally far in advance of the point in the plan where the precondition is relevant. However, mo...
Craig Boutilier