A divide-and-merge methodology for clustering

15 years 7 months ago

Download www.cs.yale.edu

We present a divide-and-merge methodology for clustering a set of objects that combines a top-down "divide" phase with a bottom-up "merge" phase. In contrast, previous algorithms use either top-down or bottom-up methods for constructing a hierarchical clustering or produce a flat clustering using local search (e.g. k-means). Our divide phase produces a tree whose leaves are the elements of the set. For this phase, we suggest an efficient spectral algorithm. The merge phase quickly finds the optimal partition that respects the tree for many natural objective functions, e.g., k-means, min-diameter, min-sum, correlation clustering, etc. We present a metasearch engine that clusters results from web searches. We also give empirical results on textbased data where the algorithm performs better than or competitively with existing clustering algorithms.

David Cheng, Santosh Vempala, Ravi Kannan, Grant W

Real-time Traffic

Database | Divide Phase | Hierarchical Clustering | Merge Phase | PODS 2005 |

claim paper

Post Info
More Details (n/a)

Added	08 Dec 2009
Updated	08 Dec 2009
Type	Conference
Year	2005
Where	PODS
Authors	David Cheng, Santosh Vempala, Ravi Kannan, Grant Wang

Comments (0)

Sciweavers

A divide-and-merge methodology for clustering

Database | Divide Phase | Hierarchical Clustering | Merge Phase | PODS 2005 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers