Sciweavers

374 search results - page 8 / 75
» Modeling Chinese Documents with Topical Word-Character Model...
Sort
View
133
Voted
EWMF
2005
Springer
15 years 9 months ago
Semi-automatic Construction of Topic Ontologies
In this paper, we review two techniques for topic discovery in collections of text documents (Latent Semantic Indexing and K-Means clustering) and present how we integrated them in...
Blaz Fortuna, Dunja Mladenic, Marko Grobelnik
156
Voted
ACML
2009
Springer
15 years 10 months ago
Estimating Likelihoods for Topic Models
Abstract. Topic models are a discrete analogue to principle component analysis and independent component analysis that model topic at the word level within a document. They have ma...
Wray L. Buntine
119
Voted
ICML
2009
IEEE
15 years 10 months ago
Topic-link LDA: joint models of topic and author community
Given a large-scale linked document collection, such as a collection of blog posts or a research literature archive, there are two fundamental problems that have generated a lot o...
Yan Liu, Alexandru Niculescu-Mizil, Wojciech Gryc
ICDM
2007
IEEE
184views Data Mining» more  ICDM 2007»
15 years 10 months ago
Bayesian Folding-In with Dirichlet Kernels for PLSI
Probabilistic latent semantic indexing (PLSI) represents documents of a collection as mixture proportions of latent topics, which are learned from the collection by an expectation...
Alexander Hinneburg, Hans-Henning Gabriel, Andr&eg...
CSL
2004
Springer
15 years 3 months ago
Contemporaneous text as side-information in statistical language modeling
We propose new methods to exploit contemporaneous text, such as on-line news articles, to improve language models for automatic speech recognition and other natural language proce...
Sanjeev Khudanpur, Woosung Kim