Clustering Trees with Instance Level Constraints

14 years 10 months ago

Download people.cs.kuleuven.be

Abstract. Constrained clustering investigates how to incorporate domain knowledge in the clustering process. The domain knowledge takes the form of constraints that must hold on the set of clusters. We consider instance level constraints, such as must-link and cannot-link. This type of constraints has been successfully used in popular clustering algorithms, such as k-means and hierarchical agglomerative clustering. This paper shows how clustering trees can support instance level constraints. Clustering trees are decision trees that partition the instances into homogeneous clusters. Clustering trees provide a symbolic description for each cluster. To handle non-trivial constraint sets, we extend clustering trees to support disjunctive descriptions. The paper’s main contribution is ClusILC, an eﬃcient algorithm for building such trees. We present experiments comparing ClusILC to COP-k-means.

Jan Struyf, Saso Dzeroski

Real-time Traffic

Clustering Process | Constrained Clustering | ECML 2007 | Instance Level Constraints | Machine Learning |

claim paper

Post Info
More Details (n/a)

Added	07 Jun 2010
Updated	07 Jun 2010
Type	Conference
Year	2007
Where	ECML
Authors	Jan Struyf, Saso Dzeroski

Comments (0)

Sciweavers

Clustering Trees with Instance Level Constraints

Clustering Process | Constrained Clustering | ECML 2007 | Instance Level Constraints | Machine Learning |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers