Clustering with Lower Bound on Similarity

15 years 11 months ago

Download www.cs.rpi.edu

We propose a new method, called SimClus, for clustering with lower bound on similarity. Instead of accepting k the number of clusters to ﬁnd, the alternative similarity-based approach imposes a lower bound on the similarity between an object and its corresponding cluster representative (with one representative per cluster). SimClus achieves a O(log n) approximation bound on the number of clusters, whereas for the best previous algorithm the bound can be as poor as O(n). Experiments on real and synthetic datasets show that our algorithm produces more than 40% fewer representative objects, yet oﬀers the same or better clustering quality. We also propose a dynamic variant of the algorithm, which can be eﬀectively used in an on-line setting.

Mohammad Al Hasan, Saeed Salem, Benjarath Pupacdi,

Real-time Traffic

Alternative Similarity-based Approach | Corresponding Cluster Representative | Data Mining | Lower Bound | PAKDD 2009 |

claim paper

» Interval Set Cluster Analysis A Reformulation

» Accelerated EMbased clustering of large data sets

» A quadratic lower bound for Rocchios similaritybased relevance feedback algorithm with a f...

» Horizontal Reduction InstanceLevel Dimensionality Reduction for Similarity Search in Large...

» Error Exponent for MultipleAccess ChannelsLower Bounds

» Indexing Spatially Sensitive Distance Measures Using Multiresolution Lower Bounds

» A Lower Bound for Primality

» Capacity Scaling of Wireless Networks With Inhomogeneous Node Density Lower Bounds

Post Info
More Details (n/a)

Added	26 Jul 2010
Updated	26 Jul 2010
Type	Conference
Year	2009
Where	PAKDD
Authors	Mohammad Al Hasan, Saeed Salem, Benjarath Pupacdi, Mohammed J. Zaki

Comments (0)

Sciweavers

Clustering with Lower Bound on Similarity

Alternative Similarity-based Approach | Corresponding Cluster Representative | Data Mining | Lower Bound | PAKDD 2009 |

Explore & Download

Productivity Tools

Sciweavers