Sciweavers

359 search results - page 4 / 72
» Document clustering using word clusters via the information ...
Sort
View
AIRS
2004
Springer
14 years 26 days ago
Automatic Word Clustering for Text Categorization Using Global Information
This paper presents a cluster-based text categorization system which uses class distributional clustering of words. We propose a new clustering model which considers the global in...
Wenliang Chen, Xingzhi Chang, Huizhen Wang, Jingbo...
HT
2000
ACM
13 years 11 months ago
Clustering hypertext with applications to web searching
Clustering separates unrelated documents and groups related documents, and is useful for discrimination, disambiguation, summarization, organization, and navigation of unstructure...
Dharmendra S. Modha, W. Scott Spangler
SDM
2003
SIAM
134views Data Mining» more  SDM 2003»
13 years 8 months ago
Hierarchical Document Clustering using Frequent Itemsets
A major challenge in document clustering is the extremely high dimensionality. For example, the vocabulary for a document set can easily be thousands of words. On the other hand, ...
Benjamin C. M. Fung, Ke Wang, Martin Ester
LWA
2004
13 years 8 months ago
Experiments in Term Weighting and Keyword Extraction in Document Clustering
We study methods to initialize or bias different clustering methods using prior information about the "importance" of a keyword w.r.t. the whole document collection or a...
Christian Borgelt, Andreas Nürnberger
KBSE
1999
IEEE
13 years 11 months ago
Automatic Software Clustering via Latent Semantic Analysis
The paper describes the initial results of applying Latent Semantic Analysis (LSA) to program source code and associated documentation. Latent Semantic Analysis is a corpus-based ...
Jonathan I. Maletic, Naveen Valluri