Optimal Solutions for Sparse Principal Component Analysis

14 years 2 months ago

Download jmlr.csail.mit.edu

Given a sample covariance matrix, we examine the problem of maximizing the variance explained by a linear combination of the input variables while constraining the number of nonzero coefﬁcients in this combination. This is known as sparse principal component analysis and has a wide array of applications in machine learning and engineering. We formulate a new semideﬁnite relaxation to this problem and derive a greedy algorithm that computes a full set of good solutions for all target numbers of non zero coefﬁcients, with total complexity O(n3), where n is the number of variables. We then use the same relaxation to derive sufﬁcient conditions for global optimality of a solution, which can be tested in O(n3) per pattern. We discuss applications in subset selection and sparse recovery and show on artiﬁcial examples and biological data that our algorithm does provide globally optimal solutions in many cases.

Alexandre d'Aspremont, Francis R. Bach, Laurent El

Real-time Traffic

CORR 2007 | Education | Non Zero Coefﬁcients | Sample Covariance Matrix | Sparse Principal Component |

claim paper

Post Info
More Details (n/a)

Added	13 Dec 2010
Updated	13 Dec 2010
Type	Journal
Year	2007
Where	CORR
Authors	Alexandre d'Aspremont, Francis R. Bach, Laurent El Ghaoui

Comments (0)

Sciweavers

Optimal Solutions for Sparse Principal Component Analysis

CORR 2007 | Education | Non Zero Coefﬁcients | Sample Covariance Matrix | Sparse Principal Component |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers