Sciweavers

PAKDD
2009
ACM

Clustering with Lower Bound on Similarity

14 years 4 months ago
Clustering with Lower Bound on Similarity
We propose a new method, called SimClus, for clustering with lower bound on similarity. Instead of accepting k the number of clusters to find, the alternative similarity-based approach imposes a lower bound on the similarity between an object and its corresponding cluster representative (with one representative per cluster). SimClus achieves a O(log n) approximation bound on the number of clusters, whereas for the best previous algorithm the bound can be as poor as O(n). Experiments on real and synthetic datasets show that our algorithm produces more than 40% fewer representative objects, yet offers the same or better clustering quality. We also propose a dynamic variant of the algorithm, which can be effectively used in an on-line setting.
Mohammad Al Hasan, Saeed Salem, Benjarath Pupacdi,
Added 26 Jul 2010
Updated 26 Jul 2010
Type Conference
Year 2009
Where PAKDD
Authors Mohammad Al Hasan, Saeed Salem, Benjarath Pupacdi, Mohammed J. Zaki
Comments (0)