Sciweavers

109 search results - page 13 / 22
» A temporal logic for Markov chains
Sort
View
ICML
2008
IEEE
14 years 9 months ago
A worst-case comparison between temporal difference and residual gradient with linear function approximation
Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them ex...
Lihong Li
IJCAI
1989
13 years 9 months ago
A New Metaphor for the Graphical Explanation of Forward-Chaining Rule Execution
: This paper describes a novel method for displaying and examining the execution space of a rule interpreter. This method provides both coarse-grained and fine-grained views. The c...
John Domingue, Marc Eisenstadt
ICML
1999
IEEE
14 years 9 months ago
Least-Squares Temporal Difference Learning
Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...
Justin A. Boyan
ICA
2010
Springer
13 years 9 months ago
Non-negative Hidden Markov Modeling of Audio with Application to Source Separation
Abstract. In recent years, there has been a great deal of work in modeling audio using non-negative matrix factorization and its probabilistic counterparts as they yield rich model...
Gautham J. Mysore, Paris Smaragdis, Bhiksha Raj
FORMATS
2003
Springer
14 years 1 months ago
Discrete-Time Rewards Model-Checked
Abstract. This paper presents a model-checking approach for analyzing discrete-time Markov reward models. For this purpose, the temporal logic probabilistic CTL is extended with re...
Suzana Andova, Holger Hermanns, Joost-Pieter Katoe...