Sciweavers

CIDM
2009
IEEE

An architecture and algorithms for multi-run clustering

14 years 4 months ago
An architecture and algorithms for multi-run clustering
—This paper addresses two main challenges for clustering which require extensive human effort: selecting appropriate parameters for an arbitrary clustering algorithm and identifying alternative clusters. We propose an architecture and a concrete system MR-CLEVER for multi-run clustering that integrates active learning with clustering algorithms. The key hypothesis of this work is that better clustering results can be obtained by combining clusters that originate from multiple runs of clustering algorithms. By defining states that represent parameter settings of a clustering algorithm, the proposed architecture actively learns a state utility function. The utility of a parameter setting is assessed based on clustering run-time, quality and novelty of the obtained clusters. Furthermore, the utility function plays an important role in guiding the clustering algorithm to seek novel solutions. Cluster novelty measures are introduced for this purpose. Finally, we also contribute a cluster ...
Rachsuda Jiamthapthaksin, Christoph F. Eick, Vadee
Added 21 Jul 2010
Updated 21 Jul 2010
Type Conference
Year 2009
Where CIDM
Authors Rachsuda Jiamthapthaksin, Christoph F. Eick, Vadeerat Rinsurongkawong
Comments (0)