Sciweavers
Explore
Publications
Books
Software
Tutorials
Presentations
Lectures Notes
Datasets
Labs
Conferences
Community
Upcoming
Conferences
Top Ranked Papers
Most Viewed Conferences
Conferences by Acronym
Conferences by Subject
Conferences by Year
Tools
Sci2ools
International Keyboard
Graphical Social Symbols
CSS3 Style Generator
OCR
Web Page to Image
Web Page to PDF
Merge PDF
Split PDF
Latex Equation Editor
Extract Images from PDF
Convert JPEG to PS
Convert Latex to Word
Convert Word to PDF
Image Converter
PDF Converter
Community
Sciweavers
About
Terms of Use
Privacy Policy
Cookies
116
search results - page 24 / 24
»
Indexing and searching tera-scale Grid-Based Digital Librari...
Sort
relevance
views
votes
recent
update
View
thumb
title
41
click to vote
SIGIR
2008
ACM
176
views
Information Technology
»
more
SIGIR 2008
»
SpotSigs: robust and efficient near duplicate detection in large web collections
13 years 8 months ago
Download
ilpubs.stanford.edu
Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...
Martin Theobald, Jonathan Siddharth, Andreas Paepc...
claim paper
Read More »
« Prev
« First
page 24 / 24
Last »
Next »