Search Sciweavers | Sciweavers

48 search results - page 1 / 10

» Collection statistics for fast duplicate document detection

135

Voted

TOIS
2002

51views more TOIS 2002»

Collection statistics for fast duplicate document detection

15 years 6 months ago

Download www.ir.iit.edu

Abdur Chowdhury, Ophir Frieder, David A. Grossman,...

claim paper

Read More »

170

click to vote

CIKM
2003
Springer

130views Information Technology» more CIKM 2003»

Online duplicate document detection: signature reliability in a dynamic retrieval environment

15 years 12 months ago

Download www.conradweb.org

As online document collections continue to expand, both on the Web and in proprietary environments, the need for duplicate detection becomes more critical. Few users wish to retri...

Jack G. Conrad, Xi S. Guo, Cindy P. Schriber

claim paper

Read More »

191

click to vote

LREC
2008

130views Education» more LREC 2008»

Detecting Co-Derivative Documents in Large Text Collections

15 years 8 months ago

Download www.lrec-conf.org

We have analyzed the SPEX algorithm by Bernstein and Zobel (2004) for detecting co-derivative documents using duplicate n-grams. Although we totally agree with the claim that not ...

Jan Pomikálek, Pavel Rychlý

claim paper

Read More »

159

click to vote

ADC
2007
Springer

108views Database» more ADC 2007»

Distributed Text Retrieval From Overlapping Collections

16 years 24 days ago

Download crpit.com

In standard text retrieval systems, the documents are gathered and indexed on a single server. In distributed information retrieval (DIR), the documents are held in multiple colle...

Milad Shokouhi, Justin Zobel, Yaniv Bernstein

claim paper

Read More »

178

click to vote

SIGIR
2004
ACM

136views Information Technology» more SIGIR 2004»

Constructing a text corpus for inexact duplicate detection

16 years 1 days ago

Download www.conradweb.org

As online document collections continue to expand, both on the Web and in proprietary environments, the need for duplicate detection becomes more critical. The goal of this work i...

Jack G. Conrad, Cindy P. Schriber

claim paper

Read More »

« Prev « First page 1 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers