Sciweavers

832 search results - page 6 / 167
» Document clustering with committees
Sort
View
ICML
1998
IEEE
14 years 8 months ago
Employing EM and Pool-Based Active Learning for Text Classification
This paper shows how a text classifier's need for labeled training documents can be reduced by taking advantage of a large pool of unlabeled documents. We modify the Query-by...
Andrew McCallum, Kamal Nigam
SIGIR
1998
ACM
13 years 12 months ago
Web Document Clustering: A Feasibility Demonstration
Users of Web search engines are often forced to sift through the long ordered list of document “snippets” returned by the engines. The IR community has explored document cluste...
Oren Zamir, Oren Etzioni
IRAL
2003
ACM
14 years 27 days ago
Keyword-based document clustering
1 Document clustering is an aggregation of related documents to a cluster based on the similarity evaluation task between documents and the representatives of clusters. Terms and t...
Seung-Shik Kang
ECIR
2007
Springer
13 years 9 months ago
A Hierarchical Consensus Architecture for Robust Document Clustering
Abstract. A major problem encountered by text clustering practitioners is the difficulty of determining a priori which is the optimal text representation and clustering technique f...
Xavier Sevillano, Germán Cobo, Francesc Al&...
LWA
2004
13 years 9 months ago
Experiments in Term Weighting and Keyword Extraction in Document Clustering
We study methods to initialize or bias different clustering methods using prior information about the "importance" of a keyword w.r.t. the whole document collection or a...
Christian Borgelt, Andreas Nürnberger