Search Sciweavers | Sciweavers

26

ICML
2008
IEEE

165views Machine Learning» more ICML 2008»

A worst-case comparison between temporal difference and residual gradient with linear function approximation

14 years 9 months ago

Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them ex...

Lihong Li

claim paper

Read More »

24

click to vote

IJCAI
1989

78views Artificial Intelligence» more IJCAI 1989»

A New Metaphor for the Graphical Explanation of Forward-Chaining Rule Execution

13 years 9 months ago

Download ijcai.org

: This paper describes a novel method for displaying and examining the execution space of a rule interpreter. This method provides both coarse-grained and fine-grained views. The c...

John Domingue, Marc Eisenstadt

claim paper

Read More »

43

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

14 years 9 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

39

click to vote

ICA
2010
Springer

261views Signal Processing» more ICA 2010»

Non-negative Hidden Markov Modeling of Audio with Application to Source Separation

13 years 9 months ago

Download web.media.mit.edu

Abstract. In recent years, there has been a great deal of work in modeling audio using non-negative matrix factorization and its probabilistic counterparts as they yield rich model...

Gautham J. Mysore, Paris Smaragdis, Bhiksha Raj

claim paper

Read More »

25

click to vote

FORMATS
2003
Springer

115views Formal Methods» more FORMATS 2003»

Discrete-Time Rewards Model-Checked

14 years 1 months ago

Download eprints.eemcs.utwente.nl

Abstract. This paper presents a model-checking approach for analyzing discrete-time Markov reward models. For this purpose, the temporal logic probabilistic CTL is extended with re...

Suzana Andova, Holger Hermanns, Joost-Pieter Katoe...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers