A General Framework for Increasing the Robustness of PCA-Based Correlation Clustering Algorithms

14 years 6 months ago

Download www.dbs.informatik.uni-muenchen.de

Abstract. Most correlation clustering algorithms rely on principal component analysis (PCA) as a correlation analysis tool. The correlation of each cluster is learned by applying PCA to a set of sample points. Since PCA is rather sensitive to outliers, if a small fraction of these points does not correspond to the correct correlation of the cluster, the algorithms are usually misled or even fail to detect the correct results. In this paper, we evaluate the inﬂuence of outliers on PCA and propose a general framework for increasing the robustness of PCA in order to determine the correct correlation of each cluster. We further show how our framework can be applied to PCA-based correlation clustering algorithms. A thorough experimental evaluation shows the beneﬁt of our framework on several synthetic and real-world data sets.

Hans-Peter Kriegel, Peer Kröger, Erich Schube

Real-time Traffic

Correct Correlation | Correlation Analysis Tool | Correlation Clustering Algorithms | Database | SSDBM 2008 |

claim paper

Post Info
More Details (n/a)

Added	01 Jun 2010
Updated	01 Jun 2010
Type	Conference
Year	2008
Where	SSDBM
Authors	Hans-Peter Kriegel, Peer Kröger, Erich Schubert, Arthur Zimek

Comments (0)

Sciweavers

A General Framework for Increasing the Robustness of PCA-Based Correlation Clustering Algorithms

Correct Correlation | Correlation Analysis Tool | Correlation Clustering Algorithms | Database | SSDBM 2008 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers