Sciweavers

802 search results - page 95 / 161
» Experts in a Markov Decision Process
Sort
View
AIPS
2000
13 years 11 months ago
On-line Scheduling via Sampling
1 We consider the problem of scheduling an unknown sequence of tasks for a single server as the tasks arrive with the goal off maximizing the total weighted value of the tasks serv...
Hyeong Soo Chang, Robert Givan, Edwin K. P. Chong
CORR
2006
Springer
113views Education» more  CORR 2006»
13 years 10 months ago
A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD
This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...
Manuel Loth, Philippe Preux
CSL
2012
Springer
12 years 5 months ago
Reinforcement learning for parameter estimation in statistical spoken dialogue systems
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...
Filip Jurcícek, Blaise Thomson, Steve Young
ICIP
2003
IEEE
14 years 11 months ago
Pixel classification through divergence-based integration of texture methods with conflict resolution
This paper presents a new technique for combining multiple texture feature extraction methods in order to classify the pixels of an input image into a set of texture models of int...
Domènec Puig, Miguel Angel García
WOA
2003
13 years 11 months ago
A Design Tool to Develop Agent-Based Workflow Management Systems
— This paper describes a methodology to design a workflow management system where a set of intelligent software agents composes an interactive scenario. The Workflow Management C...
Marco Repetto, Massimo Paolucci 0002, Antonio Bocc...