Search Sciweavers | Sciweavers

195

Voted

WWW
2004
ACM

179views Internet Technology» more WWW 2004»

Combining link and content analysis to estimate semantic similarity

16 years 8 months ago

Search engines use content and link information to crawl, index, retrieve, and rank Web pages. The correlations between similarity measures based on these cues and on semantic ass...

Filippo Menczer

claim paper

Read More »

150

click to vote

CIKM
2009
Springer

121views Information Technology» more CIKM 2009»

Graph-based seed selection for web-scale crawlers

16 years 2 months ago

Download clgiles.ist.psu.edu

One of the most important steps in web crawling is determining the starting points, or seed selection. This paper identiﬁes and explores the problem of seed selection in webscal...

Shuyi Zheng, Pavel Dmitriev, C. Lee Giles

claim paper

Read More »

183

click to vote

USS
2008

120views Operating System» more USS 2008»

There Is No Free Phish: An Analysis of "Free" and Live Phishing Kits

15 years 9 months ago

Download www.cs.ucsb.edu

Phishing is a form of identity theft in which an attacker attempts to elicit confidential information from unsuspecting victims. While in the past there has been significant work ...

Marco Cova, Christopher Kruegel, Giovanni Vigna

claim paper

Read More »

194

click to vote

ECIR
2006
Springer

134views Information Technology» more ECIR 2006»

Automatic Document Organization in a P2P Environment

15 years 9 months ago

Download ir.shef.ac.uk

Abstract. This paper describes an efficient method to construct reliable machine learning applications in peer-to-peer (P2P) networks by building ensemble based meta methods. We co...

Stefan Siersdorfer, Sergej Sizov

claim paper

Read More »

273

click to vote

SIGIR
2010
ACM

173views Information Technology» more SIGIR 2010»

The 8th workshop on large-scale distributed systems for information retrieval (LSDS-IR'10)

15 years 2 months ago

Download www.sigir.org

The size of the Web as well as user bases of search systems continue to grow exponentially. Consequently, providing subsecond query response times and high query throughput become...

Roi Blanco, Berkant Barla Cambazoglu, Claudio Lucc...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers