Search Sciweavers | Sciweavers

202

Voted

PVLDB
2010

161views more PVLDB 2010»

Annotating and Searching Web Tables Using Entities, Types and Relationships

15 years 5 months ago

Tables are a universal idiom to present relational data. Billions of tables on Web pages express entity references, attributes and relationships. This representation of relational...

Girija Limaye, Sunita Sarawagi, Soumen Chakrabarti

claim paper

Read More »

165

Voted

CIKM
2009
Springer

127views Information Technology» more CIKM 2009»

Vetting the links of the web

16 years 1 months ago

Download www.cse.lehigh.edu

Many web links mislead human surfers and automated crawlers because they point to changed content, out-of-date information, or invalid URLs. It is a particular problem for large, ...

Na Dai, Brian D. Davison

claim paper

Read More »

136

Voted

CIKM
2009
Springer

121views Information Technology» more CIKM 2009»

Graph-based seed selection for web-scale crawlers

16 years 1 months ago

Download clgiles.ist.psu.edu

One of the most important steps in web crawling is determining the starting points, or seed selection. This paper identiﬁes and explores the problem of seed selection in webscal...

Shuyi Zheng, Pavel Dmitriev, C. Lee Giles

claim paper

Read More »

220

Voted

SIGIR
2008
ACM

176views Information Technology» more SIGIR 2008»

SpotSigs: robust and efficient near duplicate detection in large web collections

15 years 6 months ago

Download ilpubs.stanford.edu

Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...

Martin Theobald, Jonathan Siddharth, Andreas Paepc...

claim paper

Read More »

187

Voted

ISW
2009
Springer

106views Information Technology» more ISW 2009»

Automated Spyware Collection and Analysis

16 years 1 months ago

Download www.cs.ucsb.edu

Various online studies on the prevalence of spyware attest overwhelming numbers (up to 80%) of infected home computers. However, the term spyware is ambiguous and can refer to anyt...

Andreas Stamminger, Christopher Kruegel, Giovanni ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers