Sciweavers

572 search results - page 17 / 115
» Winnowing-based text clustering
Sort
View
DBISP2P
2008
Springer
124views Database» more  DBISP2P 2008»
13 years 9 months ago
Exploiting Distribution Skew for Scalable P2P Text Clustering
K-Means clustering is widely used in information retrieval and data mining. Distributed K-Means variants have already been proposed, but none of the past algorithms scales to large...
Odysseas Papapetrou, Wolf Siberski, Fabian Leitrit...
ECAI
2010
Springer
13 years 8 months ago
A Very Fast Method for Clustering Big Text Datasets
Large-scale text datasets have long eluded a family of particularly elegant and effective clustering methods that exploits the power of pair-wise similarities between data points ...
Frank Lin, William W. Cohen
ICDE
2007
IEEE
211views Database» more  ICDE 2007»
14 years 2 months ago
Document Representation and Dimension Reduction for Text Clustering
Increasingly large text datasets and the high dimensionality associated with natural language create a great challenge in text mining. In this research, a systematic study is cond...
M. Mahdi Shafiei, Singer Wang, Roger Zhang, Evange...
ICDAR
2011
IEEE
12 years 7 months ago
Graph Clustering-Based Ensemble Method for Handwritten Text Line Segmentation
—Handwritten text line segmentation on real-world data presents significant challenges that cannot be overcome by any single technique. Given the diversity of approaches and the...
Vasant Manohar, Shiv Naga Prasad Vitaladevuni, Hua...
AUSAI
2005
Springer
14 years 1 months ago
Semantic Correlation Network Based Text Clustering
Abstract. Text documents have sparse data spaces, and nearest neighbors may belong to different classes when using current existing proximity measures to describe the correlation ...
Shaoxu Song, Chunping Li