Sciweavers

771 search results - page 82 / 155
» Markov Decision Processes with Arbitrary Reward Processes
Sort
View
ATAL
2007
Springer
15 years 10 months ago
A globally optimal algorithm for TTD-MDPs
In this paper, we discuss the use of Targeted Trajectory Distribution Markov Decision Processes (TTD-MDPs)—a variant of MDPs in which the goal is to realize a specified distrib...
Sooraj Bhat, David L. Roberts, Mark J. Nelson, Cha...
ICC
2009
IEEE
151views Communications» more  ICC 2009»
15 years 1 months ago
Performance Evaluation of Multiple-Relay Cooperative ARQ Strategies for Mobile Networks
In Cooperative Automatic Repeat reQuest (C-ARQ) protocols, one or more nodes can act as relays, collaborating in the frame retransmission process between a sender and a destination...
Juan J. Alcaraz, Joan García-Haro
ICASSP
2008
IEEE
15 years 10 months ago
Bayesian update of dialogue state for robust dialogue systems
This paper presents a new framework for accumulating beliefs in spoken dialogue systems. The technique is based on updating a Bayesian Network that represents the underlying state...
Blaise Thomson, Jost Schatzmann, Steve Young
EXACT
2008
15 years 6 months ago
Integrating Probabilistic and Knowledge-Based Systems for Explanation Generation
An important requirement for intelligent assistants is to have an explanation generation mechanism, so that the trainee has a better understanding of the recommended actions and ca...
Francisco Elizalde, Luis Enrique Sucar, Julieta No...
CVPR
2012
IEEE
13 years 6 months ago
RALF: A reinforced active learning formulation for object class recognition
Active learning aims to reduce the amount of labels required for classification. The main difficulty is to find a good trade-off between exploration and exploitation of the lab...
Sandra Ebert, Mario Fritz, Bernt Schiele