Search Sciweavers | Sciweavers

3 search results - page 1 / 1

» MapDupReducer: detecting near duplicates over massive datase...

click to vote

SIGMOD
2010
ACM

269views Database» more SIGMOD 2010»

MapDupReducer: detecting near duplicates over massive datasets

13 years 7 months ago

Download www.cse.unsw.edu.au

Categories and Subject Descriptors General Terms Keywords

Chaokun Wang, Jianmin Wang, Xuemin Lin, Wei Wang, ...

claim paper

Read More »

click to vote

WWW
2008
ACM

214views Internet Technology» more WWW 2008»

14 years 8 months ago

Efficient similarity joins for near duplicate detection

Download www2008.org

With the increasing amount of data and the need to integrate data from multiple data sources, a challenging issue is to find near duplicate records efficiently. In this paper, we ...

Chuan Xiao, Wei Wang 0011, Xuemin Lin, Jeffrey Xu ...

claim paper

Read More »

click to vote

KDD
2004
ACM

195views Data Mining» more KDD 2004»

Improved robustness of signature-based near-replica detection via lexicon randomization

14 years 7 months ago

Download ir.iit.edu

Detection of near duplicate documents is an important problem in many data mining and information filtering applications. When faced with massive quantities of data, traditional d...

Aleksander Kolcz, Abdur Chowdhury, Joshua Alspecto...

claim paper

Read More »

« Prev « First page 1 / 1 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers