Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

180

WWW
2010
ACM

246views Internet Technology» more WWW 2010»

Web-scale k-means clustering

16 years 2 months ago

Web-scale k-means clustering

Download www.eecs.tufts.edu

We present two modiﬁcations to the popular k-means clustering algorithm to address the extreme requirements for latency, scalability, and sparsity encountered in user-facing web applications. First, we propose the use of mini-batch optimization for k-means clustering. This reduces computation cost by orders of magnitude compared to the classic batch algorithm while yielding signiﬁcantly better solutions than online stochastic gradient descent. Second, we achieve sparsity with projected gradient descent, and give a fast ǫaccurate projection onto the L1-ball. Source code is freely available: http://code.google.com/p/sofia-ml Categories and Subject Descriptors I.5.3 [Computing Methodologies]: Pattern Recognition— Clustering General Terms Algorithms, Performance, Experimentation Keywords unsupervised clustering, scalability, sparse solutions

D. Sculley

Real-time Traffic

Gradient Descent | Internet Technology | K-means Clustering | K-means Clustering Algorithm | WWW 2010 |

claim paper

Related Content

» Fast kMeans Algorithms with Constant Approximation

» kMeans Projective Clustering

» Methods of Decreasing the Number of Support Vectors via kMean Clustering

» Stability Yields a PTAS for kMedian and kMeans Clustering

» Fast Algorithms for Constant Approximation kMeans Clustering

» A novel unsupervised classification approach for network anomaly detection by kMeans clust...

» WorstCase and Smoothed Analysis of kMeans Clustering with Bregman Divergences

» Stability of k Means Clustering

» An Efficient kMeans Clustering Algorithm Analysis and Implementation

» The Effectiveness of LloydType Methods for the kMeans Problem

Post Info
More Details (n/a)

Added	14 May 2010
Updated	14 May 2010
Type	Conference
Year	2010
Where	WWW
Authors	D. Sculley

Comments (0)