Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

179

SIGIR
2004
ACM

102views Information Technology» more SIGIR 2004»

Document clustering via adaptive subspace iteration

16 years 8 days ago

Document clustering via adaptive subspace iteration

Download science.kennesaw.edu

Document clustering has long been an important problem in information retrieval. In this paper, we present a new clustering algorithm ASI1, which uses explicitly modeling of the subspace structure associated with each cluster. ASI simultaneously performs data reduction and subspace identiﬁcation via an iterative alternating optimization procedure. Motivated from the optimization procedure, we then provide a novel method to determine the number of clusters. We also discuss the connections of ASI with various existential clustering approaches. Finally, extensive experimental results on real data sets show the effectiveness of ASI algorithm. Categories and Subject Descriptors H.3.3 [Information Search and Retrieval]: Clustering; I.2 [Artiﬁcial Intelligence]: Learning; I.5 [Pattern Recognition]: Applications General Terms Algorithms, Experimentation, Measurement, Performance, Theory, Veriﬁcation Keywords document clustering, adaptive subspace identiﬁcation, alternating optimizatio...

Tao Li, Sheng Ma, Mitsunori Ogihara

Real-time Traffic

Document Clustering | Optimization Procedure | SIGIR 2004 | Subspace Identiﬁcation |

claim paper

Related Content

» Discriminative Kmeans for Clustering

» Optimal Display Adaptation of Iconic Document Visualizations via BFOSStyle Tree Pruning

» Multiview clustering via canonical correlation analysis

» Highdiagnosability online builtin selftest of FPGAs via iterative bootstrapping

» Dual diffusion model of spreading activation for contentbased image retrieval

» Unsupervised language model adaptation via topic modeling based on named entity hypotheses

» Pairwise ConstraintsGuided Nonnegative Matrix Factorization for Document Clustering

» Integrating Element and Term Semantics for SimilarityBased XML Document Clustering

» Scalable Balanced Modelbased Clustering

Post Info
More Details (n/a)

Added	30 Jun 2010
Updated	30 Jun 2010
Type	Conference
Year	2004
Where	SIGIR
Authors	Tao Li, Sheng Ma, Mitsunori Ogihara

Comments (0)