Sciweavers

84 search results - page 3 / 17
» Asynchronous Distributed Learning of Topic Models
Sort
View
ICML
2009
IEEE
14 years 9 months ago
Incorporating domain knowledge into topic modeling via Dirichlet Forest priors
Users of topic modeling methods often have knowledge about the composition of words that should have high or low probability in various topics. We incorporate such domain knowledg...
David Andrzejewski, Xiaojin Zhu, Mark Craven
EMNLP
2008
13 years 10 months ago
HTM: A Topic Model for Hypertexts
Previously topic models such as PLSI (Probabilistic Latent Semantic Indexing) and LDA (Latent Dirichlet Allocation) were developed for modeling the contents of plain texts. Recent...
Congkai Sun, Bin Gao, Zhenfu Cao, Hang Li
DEXAW
2010
IEEE
202views Database» more  DEXAW 2010»
13 years 9 months ago
Identifying Sentence-Level Semantic Content Units with Topic Models
Abstract--Statistical approaches to document content modeling typically focus either on broad topics or on discourselevel subtopics of a text. We present an analysis of the perform...
Leonhard Hennig, Thomas Strecker, Sascha Narr, Ern...
KDD
2007
ACM
206views Data Mining» more  KDD 2007»
14 years 9 months ago
Automatic labeling of multinomial topic models
Multinomial distributions over words are frequently used to model topics in text collections. A common, major challenge in applying all such topic models to any text mining proble...
Qiaozhu Mei, Xuehua Shen, ChengXiang Zhai
ICGI
2010
Springer
13 years 7 months ago
Learning PDFA with Asynchronous Transitions
In this paper we extend the PAC learning algorithm due to Clark and Thollard for learning distributions generated by PDFA to automata whose transitions may take varying time length...
Borja Balle, Jorge Castro, Ricard Gavaldà