Parallel Spectral Clustering in Distributed Systems

14 years 9 months ago

Download www.csie.ntu.edu.tw

Spectral clustering algorithms have been shown to be more effective in ﬁnding clusters than some traditional algorithms such as k-means. However, spectral clustering suffers from a scalability problem in both memory use and computational time when the size of a data set is large. To perform clustering on large data sets, we investigate two representative ways of approximating the dense similarity matrix. We compare one approach by sparsifying the matrix with another by the Nyström method. We then pick the strategy of sparsifying the matrix via retaining nearest neighbors and investigate its parallelization. We parallelize both memory use and computation on distributed computers. Through an empirical study on a document data set of 193, 844 instances and a photo data set of 2, 121, 863, we show that our parallel algorithm can effectively handle large problems.

Wen-Yen Chen, Yangqiu Song, Hongjie Bai, Chih-Jen

Real-time Traffic

Operations Research | PAMI 2011 | Spectral Clustering | Spectral Clustering Algorithms | Spectral Clustering Suffers |

claim paper

» Runtime system support for softwareguided disk power management

» Profiling services for resource optimization and capacity planning in distributed systems

» Cplant Runtime System Support for MultiProcessor and Heterogeneous Compute Nodes

» Autonomic power and performance management for computing systems

» FailureAtomic File Access in the Slice Interposed Network Storage System

» DCR A fully transparent checkpointrestart framework for distributed systems

» A Kernel Running in a DSM Design Aspects of a Distributed Operating System

» A global operating system for HPC clusters

Post Info
More Details (n/a)

Added	14 May 2011
Updated	14 May 2011
Type	Journal
Year	2011
Where	PAMI
Authors	Wen-Yen Chen, Yangqiu Song, Hongjie Bai, Chih-Jen Lin, Edward Y. Chang

Comments (0)

Sciweavers

Parallel Spectral Clustering in Distributed Systems

Operations Research | PAMI 2011 | Spectral Clustering | Spectral Clustering Algorithms | Spectral Clustering Suffers |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers