Sciweavers

180 search results - page 8 / 36
» A Method for Calculating Term Similarity on Large Document C...
Sort
View
CLEF
2010
Springer
13 years 8 months ago
A Textual-Based Similarity Approach for Efficient and Scalable External Plagiarism Analysis - Lab Report for PAN at CLEF 2010
In this paper we present an approach to detect external plagiarism based on textual similarity. This is an efficient and precise method that can be applied over large sets of docum...
Daniel Micol, Óscar Ferrández, Ferna...
SETN
2004
Springer
14 years 27 days ago
Clustering XML Documents by Structure
This work explores the application of clustering methods for grouping structurally similar XML documents. Modeling the XML documents as rooted ordered labeled trees, we apply clust...
Theodore Dalamagas, Tao Cheng, Klaas-Jan Winkel, T...
CORR
2006
Springer
132views Education» more  CORR 2006»
13 years 7 months ago
Navigating multilingual news collections using automatically extracted information
We are presenting a text analysis tool set that allows analysts in various fields to sieve through large collections of multilingual news items quickly and to find information that...
Ralf Steinberger, Bruno Pouliquen, Camelia Ignat
BMCBI
2007
163views more  BMCBI 2007»
13 years 7 months ago
A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluati
Background: A huge amount of biomedical textual information has been produced and collected in MEDLINE for decades. In order to easily utilize biomedical information in the free t...
Illhoi Yoo, Xiaohua Hu, Il-Yeol Song
SIGIR
2010
ACM
13 years 2 months ago
Efficient partial-duplicate detection based on sequence matching
With the ever-increasing growth of the Internet, numerous copies of documents become serious problem for search engine, opinion mining and many other web applications. Since parti...
Qi Zhang, Yue Zhang, Haomin Yu, Xuanjing Huang