Sciweavers

808 search results - page 119 / 162
» Keyword-based document clustering
Sort
View
CIKM
2006
Springer
13 years 11 months ago
Multi-task text segmentation and alignment based on weighted mutual information
Text segmentation is important for text analysis, while text alignment is to determine shared sub-topics among similar documents. Multi-task text segmentation and alignment is the...
Bingjun Sun, Ding Zhou, Hongyuan Zha, John Yen
RANLP
2003
13 years 9 months ago
A framework for named entity recognition in the open domain
In this paper, a system for Named Entity Recognition in the Open domain (NERO) is described. It is concerned with recognition of various types of entity, types that will be approp...
Richard J. Evans
CORR
2007
Springer
117views Education» more  CORR 2007»
13 years 7 months ago
Dirac Notation, Fock Space and Riemann Metric Tensor in Information Retrieval Models
Using Dirac Notation as a powerful tool, we investigate the three classical Information Retrieval (IR) models and some their extensions. We show that almost all such models can be...
Xing M. Wang
SIGIR
2002
ACM
13 years 7 months ago
Language model for IR using collection information
In this paper, we explored how to use meta-data information in information retrieval task. We presented a new language model that is able to take advantage of the category informa...
Rong Jin, Luo Si, Alexander G. Hauptmann, James P....
JACM
2010
208views more  JACM 2010»
13 years 6 months ago
The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies
clustering of documents according to sharing of topics at multiple levels of abstraction. Given a corpus of documents, a posterior inference algorithm finds an approximation to a ...
David M. Blei, Thomas L. Griffiths, Michael I. Jor...