Sciweavers

572 search results - page 40 / 115
» Winnowing-based text clustering
Sort
View
CORR
2006
Springer
178views Education» more  CORR 2006»
13 years 10 months ago
A tool set for the quick and efficient exploration of large document collections
: We are presenting a set of multilingual text analysis tools that can help analysts in any field to explore large document collections quickly in order to determine whether the do...
Camelia Ignat, Bruno Pouliquen, Ralf Steinberger, ...
COLING
2008
13 years 11 months ago
Rank Distance as a Stylistic Similarity
In this paper we propose a new distance function (rank distance) designed to reflect stylistic similarity between texts. To assess the ability of this distance measure to capture ...
Marius Popescu, Liviu Petrisor Dinu
EACL
2003
ACL Anthology
13 years 11 months ago
Combining Distributional and Morphological Information for Part of Speech Induction
In this paper we discuss algorithms for clustering words into classes from unlabelled text using unsupervised algorithms, based on distributional and morphological information. We...
Alexander Clark
CPM
2006
Springer
140views Combinatorics» more  CPM 2006»
14 years 1 months ago
Identifying Co-referential Names Across Large Corpora
A single logical entity can be referred to by several different names over a large text corpus. We present our algorithm for finding all suchco-reference sets in a large corpus. Ou...
Levon Lloyd, Andrew Mehler, Steven Skiena
DMIN
2006
146views Data Mining» more  DMIN 2006»
13 years 11 months ago
A Comparison of Two Document Clustering Approaches for Clustering Medical Documents
Medical data is often presented as free text in the form of medical reports. Such documents contain important information about patients, disease progression and management, but ar...
Fathi H. Saad, Beatriz de la Iglesia, Duncan G. Be...