Consistent Minimization of Clustering Objective Functions

15 years 8 months ago

Download www.kyb.tuebingen.mpg.de

Clustering is often formulated as a discrete optimization problem. The objective is to ﬁnd, among all partitions of the data set, the best one according to some quality measure. However, in the statistical setting where we assume that the ﬁnite data set has been sampled from some underlying space, the goal is not to ﬁnd the best partition of the given sample, but to approximate the true partition of the underlying space. We argue that the discrete optimization approach usually does not achieve this goal. As an alternative, we suggest the paradigm of “nearest neighbor clustering”. Instead of selecting the best out of all partitions of the sample, it only considers partitions in some restricted function class. Using tools from statistical learning theory we prove that nearest neighbor clustering is statistically consistent. Moreover, its worst case complexity is polynomial by construction, and it can be implemented with small average case complexity using branch and bound.

Ulrike von Luxburg, Sébastien Bubeck, Stefa

Real-time Traffic

Data Set | Discrete Optimization | Information Technology | Nearest Neighbor Clustering | NIPS 2007 |

claim paper

» Minimization of Locally Defined Submodular Functions by Optimal Soft Arc Consistency

» Segmentation of nonrigid video objects using long term temporal consistency

» Gaussian Mixture Model with Local Consistency

» Correlation Clustering Revisited The True Cost of Error Minimization Problems

» Online adaptive clustering in a decision tree framework

» Agglomerative Fuzzy KMeans Clustering Algorithm with Selection of Number of Clusters

» Optimization and Simplification of Hierarchical Clusterings

» Is Objective Function the Silver Bullet A Case Study of Community Detection Algorithms on ...

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2007
Where	NIPS
Authors	Ulrike von Luxburg, Sébastien Bubeck, Stefanie Jegelka, Michael Kaufmann

Comments (0)

Sciweavers

Consistent Minimization of Clustering Objective Functions

Data Set | Discrete Optimization | Information Technology | Nearest Neighbor Clustering | NIPS 2007 |

Explore & Download

Productivity Tools

Sciweavers