Search Sciweavers | Sciweavers

170

SIGIR
2008
ACM

176views Information Technology» more SIGIR 2008»

SpotSigs: robust and efficient near duplicate detection in large web collections

15 years 4 months ago

Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...

Martin Theobald, Jonathan Siddharth, Andreas Paepc...

claim paper

Read More »

133

click to vote

ICIP
2000
IEEE

141views Image Processing» more ICIP 2000»

Efficient Video Similarity Measurement and Search

16 years 5 months ago

Download www.vis.uky.edu

We consider the use of meta-data and/or video-domain methods to detect similar videos on the web. Meta-data is extracted from the textual and hyperlink information associated with...

Sen-Ching S. Cheung, Avideh Zakhor

claim paper

Read More »

134

click to vote

WWW
2004
ACM

179views Internet Technology» more WWW 2004»

Combining link and content analysis to estimate semantic similarity

16 years 5 months ago

Download www.informatics.indiana.edu

Search engines use content and link information to crawl, index, retrieve, and rank Web pages. The correlations between similarity measures based on these cues and on semantic ass...

Filippo Menczer

claim paper

Read More »

121

click to vote

USS
2008

120views Operating System» more USS 2008»

There Is No Free Phish: An Analysis of "Free" and Live Phishing Kits

15 years 6 months ago

Download www.cs.ucsb.edu

Phishing is a form of identity theft in which an attacker attempts to elicit confidential information from unsuspecting victims. While in the past there has been significant work ...

Marco Cova, Christopher Kruegel, Giovanni Vigna

claim paper

Read More »

131

click to vote

ECIR
2006
Springer

134views Information Technology» more ECIR 2006»

Automatic Document Organization in a P2P Environment

15 years 5 months ago

Download ir.shef.ac.uk

Abstract. This paper describes an efficient method to construct reliable machine learning applications in peer-to-peer (P2P) networks by building ensemble based meta methods. We co...

Stefan Siersdorfer, Sergej Sizov

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers