K-means clustering of proportional data using L1 distance

16 years 9 months ago

Download domino.watson.ibm.com

We present a new L1-distance-based k-means clustering algorithm to address the challenge of clustering high-dimensional proportional vectors. The new algorithm explicitly incorporates proportionality constraints in the computation of the cluster centroids, resulting in reduced L1 error rates. We compare the new method to two competing methods, an approximate L1distance k-means algorithm, where the centroid is estimated using cluster means, and a median L1 k-means algorithm, where the centroid is estimated using cluster medians, with proportionality constraints imposed by normalization in a second step. Application to clustering of projects based on distribution of labor hours by skill illustrates the advantages of the new algorithm.

Bonnie K. Ray, Hisashi Kashima, Jianying Hu, Monin

Real-time Traffic

Approximate L1distance K-means | Computer Vision | ICPR 2008 | K-means Clustering Algorithm | Median L1 K-means |

claim paper

Post Info
More Details (n/a)

Added	05 Nov 2009
Updated	06 Nov 2009
Type	Conference
Year	2008
Where	ICPR
Authors	Bonnie K. Ray, Hisashi Kashima, Jianying Hu, Moninder Singh

Comments (0)

Sciweavers

K-means clustering of proportional data using L1 distance

Approximate L1distance K-means | Computer Vision | ICPR 2008 | K-means Clustering Algorithm | Median L1 K-means |

Explore & Download

Productivity Tools

Sciweavers