Sciweavers

57 search results - page 1 / 12
» Evaluation of Text Clustering Algorithms with N-Gram-Based D...
Sort
View
ECIR
2009
Springer
14 years 4 months ago
Evaluation of Text Clustering Algorithms with N-Gram-Based Document Fingerprints
This paper presents a new approach designed to reduce the computational load of the existing clustering algorithms by trimming down the documents size using fingerprinting methods...
Javier Parapar, Alvaro Barreiro
CIKM
2008
Springer
13 years 9 months ago
Winnowing-based text clustering
We present an approach to document clustering based on winnowing fingerprints that achieved good values of effectiveness with considerable save in memory space and computation tim...
Javier Parapar, Alvaro Barreiro
JCDL
2011
ACM
374views Education» more  JCDL 2011»
12 years 10 months ago
Comparative evaluation of text- and citation-based plagiarism detection approaches using guttenplag
Various approaches for plagiarism detection exist. All are based on more or less sophisticated text analysis methods such as string matching, fingerprinting or style comparison. I...
Bela Gipp, Norman Meuschke, Jöran Beel
CICLING
2008
Springer
13 years 9 months ago
Evaluation of Internal Validity Measures in Short-Text Corpora
Short texts clustering is one of the most difficult tasks in natural language processing due to the low frequencies of the document terms. We are interested in analysing these kind...
Diego Ingaramo, David Pinto, Paolo Rosso, Marcelo ...
CIKM
2006
Springer
13 years 11 months ago
Incremental hierarchical clustering of text documents
Incremental hierarchical text document clustering algorithms are important in organizing documents generated from streaming on-line sources, such as, Newswire and Blogs. However, ...
Nachiketa Sahoo, Jamie Callan, Ramayya Krishnan, G...