PBIRCH: A Scalable Parallel Clustering algorithm for Incremental Data

16 years 19 days ago

Download www.cs.gsu.edu

We present a parallel version of BIRCH with the objective of enhancing the scalability without compromising on the quality of clustering. The incoming data is distributed in a cyclic manner (or block cyclic manner if the data is bursty) to balance the load among processors. The algorithm is implemented on a message passing share-nothing model. Experiments show that for very large data sets the algorithm scales nearly linearly with the increasing number of processors. Experiments also show that clusters obtained by PBIRCH are comparable to those obtained using BIRCH.

Ashwani Garg, Ashish Mangla, Neelima Gupta, Vasudh

Real-time Traffic

Block Cyclic Manner | Cyclic Manner | Database | IDEAS 2006 | Incoming Data |

claim paper

» A Scalable Parallel Algorithm for SelfOrganizing Maps with Applications to Sparse Data Min...

» A platform for scalable onepass analytics using MapReduce

» A Scalable Parallel Subspace Clustering Algorithm for Massive Data Sets

» A Parallel Algorithm for Incremental Compact Clustering

» Parallelizing Skyline Queries for Scalable Distribution

» A Scalable Collaborative Filtering Framework Based on CoClustering

» ADWICE Anomaly Detection with RealTime Incremental Clustering

» Clustering Large Attributed Graphs An Efficient Incremental Approach

Post Info
More Details (n/a)

Added	11 Jun 2010
Updated	11 Jun 2010
Type	Conference
Year	2006
Where	IDEAS
Authors	Ashwani Garg, Ashish Mangla, Neelima Gupta, Vasudha Bhatnagar

Comments (0)

Sciweavers

PBIRCH: A Scalable Parallel Clustering algorithm for Incremental Data

Block Cyclic Manner | Cyclic Manner | Database | IDEAS 2006 | Incoming Data |

Explore & Download

Productivity Tools

Sciweavers