Sciweavers

808 search results - page 31 / 162
» Keyword-based document clustering
Sort
View
ADL
2000
Springer
265views Digital Library» more  ADL 2000»
14 years 2 days ago
Clustering and Identifying Temporal Trends in Document Databases
We introduce a simple and efficient method for clustering and identifying temporal trends in hyper-linked document databases. Our method can scale to large datasets because it ex...
Alexandrin Popescul, Gary William Flake, Steve Law...
ICDM
2006
IEEE
132views Data Mining» more  ICDM 2006»
14 years 1 months ago
High Quality, Efficient Hierarchical Document Clustering Using Closed Interesting Itemsets
High dimensionality remains a significant challenge for document clustering. Recent approaches used frequent itemsets and closed frequent itemsets to reduce dimensionality, and to...
Hassan H. Malik, John R. Kender
TKDE
2011
280views more  TKDE 2011»
13 years 2 months ago
Locally Consistent Concept Factorization for Document Clustering
—Previous studies have demonstrated that document clustering performance can be improved significantly in lower dimensional linear subspaces. Recently, matrix factorization base...
Deng Cai, Xiaofei He, Jiawei Han
DIAL
2006
IEEE
167views Image Analysis» more  DIAL 2006»
14 years 1 months ago
Tree clustering for layout-based document image retrieval
We describe a system for the retrieval on the basis of layout similarity of document images belonging to collections stored in digital libraries. Layout regions are extracted and ...
Simone Marinai, Emanuele Marino, Giovanni Soda
WWW
2006
ACM
14 years 1 months ago
Using proportional transportation similarity with learned element semantics for XML document clustering
This paper proposes a novel approach to measuring XML document similarity by taking into account the semantics between XML elements. The motivation of the proposed approach is to ...
Xiaojun Wan, Jianwu Yang