Are approximation algorithms for consensus clustering worthwhile?

14 years 5 months ago

Download siam.org

Consensus clustering has emerged as one of the principal clustering problems in the data mining community. In recent years the theoretical computer science community has generated a number of approximation algorithms for consensus clustering and similar problems. These algorithms run in polynomial time, with performance guaranteed to be at most a certain factor worse than optimal. We investigate the feasibility of the approximation algorithms, in an attempt to link data-mining and theoretical research. On realistic data sets, algorithms with quadratic running times are impractical. Unfortunately these and even worse running times are typical of approximation algorithms. To circumvent this, we sample from the data, run the “slow” algorithms on the sample, and then build a consensus clustering from the seed sample clustering, using a range of techniques. These unsampling techniques are in fact almost as good at creating consensus partitionings as the approximation and data-mining al...

Michael Bertolacci, Anthony Wirth

Real-time Traffic

Approximation Algorithms | Consensus Clustering | Data Mining | Principal Clustering Problems | SDM 2007 |

claim paper

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2007
Where	SDM
Authors	Michael Bertolacci, Anthony Wirth

Comments (0)

Sciweavers

Are approximation algorithms for consensus clustering worthwhile?

Approximation Algorithms | Consensus Clustering | Data Mining | Principal Clustering Problems | SDM 2007 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers