Sciweavers

1125 search results - page 8 / 225
» A flocking based algorithm for document clustering analysis
Sort
View
EEE
2005
IEEE
14 years 1 months ago
Learning the Kernel Matrix for XML Document Clustering
The rapid growth of XML adoption has urged for the need of a proper representation for semi-structured documents, where the document structural information has to be taken into ac...
Jianwu Yang, William Kwok-Wai Cheung, Xiaoou Chen
DMIN
2006
143views Data Mining» more  DMIN 2006»
13 years 9 months ago
Reverse Tree Clustering
Common document clustering algorithms utilize models that either divide a corpus into smaller clusters or gather individual documents into clusters. Hierarchical Agglomerative Clus...
Casey Bartman, Jamal R. Alsabbagh
IADIS
2004
13 years 9 months ago
'surfing for knowledge' finding semantically similar Web clusters
In this paper we present our technique for finding semantically similar clusters within web documents obtained from a set of queries retrieved from the Google search engine. This ...
David Cleary, Diarmuid O'Donoghue
ICDAR
2009
IEEE
13 years 5 months ago
Analysis of Book Documents' Table of Content Based on Clustering
Table of contents (TOC) recognition has attracted a great deal of attention in recent years. After reviewing the merits and drawbacks of the existing TOC recognition methods, we h...
Liangcai Gao, Zhi Tang, Xiaofan Lin, Xin Tao, Yimi...
ICDE
2007
IEEE
211views Database» more  ICDE 2007»
14 years 2 months ago
Document Representation and Dimension Reduction for Text Clustering
Increasingly large text datasets and the high dimensionality associated with natural language create a great challenge in text mining. In this research, a systematic study is cond...
M. Mahdi Shafiei, Singer Wang, Roger Zhang, Evange...