Sciweavers

182 search results - page 14 / 37
» Probabilistic Document Length Priors for Language Models
Sort
View
ICDM
2007
IEEE
184views Data Mining» more  ICDM 2007»
14 years 1 months ago
Bayesian Folding-In with Dirichlet Kernels for PLSI
Probabilistic latent semantic indexing (PLSI) represents documents of a collection as mixture proportions of latent topics, which are learned from the collection by an expectation...
Alexander Hinneburg, Hans-Henning Gabriel, Andr&eg...
KDD
2007
ACM
148views Data Mining» more  KDD 2007»
14 years 8 months ago
Detecting research topics via the correlation between graphs and texts
In this paper we address the problem of detecting topics in large-scale linked document collections. Recently, topic detection has become a very active area of research due to its...
Yookyung Jo, Carl Lagoze, C. Lee Giles
CIKM
2005
Springer
14 years 1 months ago
Document quality models for web ad hoc retrieval
The quality of document content, which is an issue that is usually ignored for the traditional ad hoc retrieval task, is a critical issue for Web search. Web pages have a huge var...
Yun Zhou, W. Bruce Croft
CIKM
2008
Springer
13 years 9 months ago
A generative retrieval model for structured documents
Structured documents contain elements defined by the author(s) and annotations assigned by other people or processes. Structured documents pose challenges for probabilistic retrie...
Le Zhao, Jamie Callan
CIKM
2008
Springer
13 years 9 months ago
Modeling document features for expert finding
We argue that expert finding is sensitive to multiple document features in an organization, and therefore, can benefit from the incorporation of these document features. We propos...
Jianhan Zhu, Dawei Song, Stefan M. Rüger, Xia...