Sciweavers

81 search results - page 6 / 17
» On the Evaluation of Document Analysis Components by Recall,...
Sort
View
CLEF
2011
Springer
12 years 7 months ago
Intrinsic Plagiarism Detection Using Character Trigram Distance Scores - Notebook for PAN at CLEF 2011
Abstract In this paper, we describe a novel approach to intrinsic plagiarism detection. Each suspicious document is divided into a series of consecutive, potentially overlapping ā€...
Mike Kestemont, Kim Luyckx, Walter Daelemans
COLING
2010
13 years 2 months ago
Topic-Based Bengali Opinion Summarization
In this paper the development of an opinion summarization system that works on Bengali News corpus has been described. The system identifies the sentiment information in each docu...
Amitava Das, Sivaji Bandyopadhyay
WWW
2003
ACM
14 years 8 months ago
Detecting Near-replicas on the Web by Content and Hyperlink Analysis
The presence of replicas or near-replicas of documents is very common on the Web. Documents may be replicated completely or partially for different reasons (versions, mirrors, etc...
Ernesto Di Iorio, Michelangelo Diligenti, Marco Go...
CIKM
2010
Springer
13 years 6 months ago
Ranking related entities: components and analyses
Related entity ļ¬nding is the task of returning a ranked list of homepages of relevant entities of a speciļ¬ed type that need to engage in a given relationship with a given sour...
Marc Bron, Krisztian Balog, Maarten de Rijke
ICDAR
2005
IEEE
14 years 1 months ago
An Approach towards Benchmarking of Table Structure Recognition Results
After developing a model free table recognition system we wanted to tune parameters in order to optimize the recognition performance. Therefore we developed a benchmarking environ...
Thomas Kieninger, Andreas Dengel