Sciweavers

572 search results - page 23 / 115
» Winnowing-based text clustering
Sort
View
AI
2008
Springer
13 years 9 months ago
A Statistical Model for Topic Segmentation and Clustering
This paper presents a statistical model for discovering topical clusters of words in unstructured text. The model uses a hierarchical Bayesian structure and it is also able to iden...
M. Mahdi Shafiei, Evangelos E. Milios
SEKE
2010
Springer
13 years 6 months ago
Incremental Construction of Topic Hierarchies using Hierarchical Term Clustering
Topic hierarchies are very useful for managing, searching and browsing large repositories of text documents. The hierarchical clustering methods are used to support the constructi...
Ricardo M. Marcacini, Solange O. Rezende
ICDM
2008
IEEE
147views Data Mining» more  ICDM 2008»
14 years 2 months ago
Clustering Documents with Active Learning Using Wikipedia
Wikipedia has been applied as a background knowledge base to various text mining problems, but very few attempts have been made to utilize it for document clustering. In this pape...
Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...
IMCSIT
2010
13 years 5 months ago
Evaluation of Clustering Algorithms for Polish Word Sense Disambiguation
Word Sense Disambiguation in text is still a difficult problem as the best supervised methods require laborious and costly manual preparation of training data. Thus, this work focu...
Bartosz Broda, Wojciech Mazur
KDD
2009
ACM
243views Data Mining» more  KDD 2009»
14 years 8 months ago
Exploiting Wikipedia as external knowledge for document clustering
In traditional text clustering methods, documents are represented as "bags of words" without considering the semantic information of each document. For instance, if two ...
Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, ...