Sciweavers

572 search results - page 24 / 115
» Winnowing-based text clustering
Sort
View
SIGIR
2006
ACM
14 years 1 months ago
Feature diversity in cluster ensembles for robust document clustering
The performance of document clustering systems depends on employing optimal text representations, which are not only difficult to determine beforehand, but also may vary from one ...
Xavier Sevillano, Germán Cobo, Francesc Al&...
DEXAW
2007
IEEE
133views Database» more  DEXAW 2007»
13 years 11 months ago
Generating a Topic Hierarchy from Dialect Texts
We built a system for the automatic creation of a textbased topic hierarchy, meant to be used in a geographically defined community. This poses two main problems. First, the appea...
Wim De Smet, Marie-Francine Moens
ACL
2008
13 years 9 months ago
Learning Document-Level Semantic Properties from Free-Text Annotations
This paper demonstrates a new method for leveraging unstructured annotations to infer semantic document properties. We consider the domain of product reviews, which are often anno...
S. R. K. Branavan, Harr Chen, Jacob Eisenstein, Re...
DEXAW
2008
IEEE
128views Database» more  DEXAW 2008»
14 years 2 months ago
Proximity Estimation and Hardness of Short-Text Corpora
Abstract—In this work, we investigate the relative hardness of shorttext corpora in clustering problems and how this hardness relates to traditional similarity measures. Our appr...
Marcelo Luis Errecalde, Diego Ingaramo, Paolo Ross...
ACL
1998
13 years 9 months ago
Terminological Variation, a Means of Identifying Research Topics from Texts
After extracting terms from a corpus of titles and s in English, syntactic variation relations are identified amongst them in order to detect research topics. Three types of synta...
Fidelia Ibekwe-Sanjuan