Feature diversity in cluster ensembles for robust document clustering

15 years 1 months ago

Download serpens.salleurl.edu

The performance of document clustering systems depends on employing optimal text representations, which are not only diﬃcult to determine beforehand, but also may vary from one clustering problem to another. As a ﬁrst step towards building robust document clusterers, a strategy based on feature diversity and cluster ensembles is presented in this work. Experiments conducted on a binary clustering problem show that our method is robust to near-optimal model order selection and able to detect constructive interactions between diﬀerent document representations in the test bed. Categories and Subject Descriptors I.2.7 [Artiﬁcial Intelligence]: Natural Language Processing—Text Analysis; I.5.3 [Pattern Recognition]: Clustering—Algorithms General Terms Algorithms, Design, Experimentation, Performance Keywords Document clustering, feature extraction, cluster ensembles

Xavier Sevillano, Germán Cobo, Francesc Al&

Real-time Traffic

Clustering Problem | Document Clustering | Document Clustering Systems | SIGIR 2006 |

claim paper

Post Info
More Details (n/a)

Added	14 Jun 2010
Updated	14 Jun 2010
Type	Conference
Year	2006
Where	SIGIR
Authors	Xavier Sevillano, Germán Cobo, Francesc Alías, Joan Claudi Socoró

Comments (0)

Sciweavers

Feature diversity in cluster ensembles for robust document clustering

Clustering Problem | Document Clustering | Document Clustering Systems | SIGIR 2006 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers