Efficient Parallel Hierarchical Clustering

15 years 10 months ago

Download www.eecs.northwestern.edu

Hierarchical agglomerative clustering (HAC) is a common clustering method that outputs a dendrogram showing all N levels of agglomerations where N is the number of objects in the data set. High time and memory complexities are some of the major bottlenecks in its application to real-world problems. In the literature parallel algorithms are proposed to overcome these limitations. But, as this paper shows, existing parallel HAC algorithms are inefficient due to ineffective partitioning of the data. We first show how HAC follows a rule where most agglomerations have very small dissimilarity and only a small portion towards the end have large dissimilarity. Partially overlapping partitioning (POP) exploits this principle and obtains efficient yet accurate HAC algorithms. The total number of dissimilarities is reduced by a factor close to the number of cells in the partition. We present pPOP, the parallel version of POP, that is implemented on a shared memory multiprocessor architecture. Ex...

Manoranjan Dash, Simona Petrutiu, Peter Scheuerman

Real-time Traffic

Distributed And Parallel Computing | EUROPAR 2004 | HAC Algorithms | Parallel Algorithms | Parallel Hac Algorithms |

claim paper

» Distributed EnergyEfficient Hierarchical Clustering for Wireless Sensor Networks

» BandwidthEfficient Collective Communication for Clustered Wide Area Systems

» Improving system efficiency through scheduling and power management

» Performance Analysis and Optimization of Parallel Scientific Applications on CMP Cluster S...

» Evaluating memory energy efficiency in parallel IO workloads

» Hierarchical Bloom filter arrays HBA a novel scalable metadata management system for large...

» An Efficient Network API for inKernel Applications in Clusters

» An efficient hardwaresoftware approach to network fault tolerance with InfiniBand

Post Info
More Details (n/a)

Added	20 Aug 2010
Updated	20 Aug 2010
Type	Conference
Year	2004
Where	EUROPAR
Authors	Manoranjan Dash, Simona Petrutiu, Peter Scheuermann

Comments (0)

Sciweavers

Efficient Parallel Hierarchical Clustering

Distributed And Parallel Computing | EUROPAR 2004 | HAC Algorithms | Parallel Algorithms | Parallel Hac Algorithms |

Explore & Download

Productivity Tools

Sciweavers