Sciweavers

110 search results - page 6 / 22
» A Comparison of Two Document Clustering Approaches for Clust...
Sort
View
EMNLP
2009
13 years 5 months ago
Unsupervised morphological segmentation and clustering with document boundaries
Many approaches to unsupervised morphology acquisition incorporate the frequency of character sequences with respect to each other to identify word stems and affixes. This typical...
Taesun Moon, Katrin Erk, Jason Baldridge
EMNLP
2009
13 years 5 months ago
Multilingual Spectral Clustering Using Document Similarity Propagation
We present a novel approach for multilingual document clustering using only comparable corpora to achieve cross-lingual semantic interoperability. The method models document colle...
Dani Yogatama, Kumiko Tanaka-Ishii
IMCSIT
2010
13 years 5 months ago
Using Self Organizing Map to Cluster Arabic Crime Documents
This paper presents a system that combines two text mining techniques; information extraction and clustering. A rulebased approach is used to perform the information extraction tas...
Meshrif Alruily, Aladdin Ayesh, Abdulsamad Al-Marg...
CLEF
2010
Springer
13 years 8 months ago
Cross-document Coreference for WePS
A good clustering performance depends on the quality of the distance function used to asses similarity. In this paper we propose a pairwise document coreference model to improve pe...
Iustin Dornescu, Constantin Orasan, Tatiana Lesnik...
WIDM
2003
ACM
14 years 28 days ago
Clustering documents in a web directory
Hierarchical categorization of documents is a task receiving growing interest due to the widespread proliferation of topic hierarchies for text documents. The worst problem of hie...
Giordano Adami, Paolo Avesani, Diego Sona