Search Sciweavers | Sciweavers

178 search results - page 20 / 36

» Scheduling Algorithms for Web Crawling

169

click to vote

ICIP
2000
IEEE

141views Image Processing» more ICIP 2000»

Efficient Video Similarity Measurement and Search

16 years 8 months ago

Download www.vis.uky.edu

We consider the use of meta-data and/or video-domain methods to detect similar videos on the web. Meta-data is extracted from the textual and hyperlink information associated with...

Sen-Ching S. Cheung, Avideh Zakhor

claim paper

Read More »

171

click to vote

CIKM
2009
Springer

127views Information Technology» more CIKM 2009»

Vetting the links of the web

16 years 1 months ago

Download www.cse.lehigh.edu

Many web links mislead human surfers and automated crawlers because they point to changed content, out-of-date information, or invalid URLs. It is a particular problem for large, ...

Na Dai, Brian D. Davison

claim paper

Read More »

198

click to vote

ICDM
2006
IEEE

164views Data Mining» more ICDM 2006»

Unsupervised Learning of Tree Alignment Models for Information Extraction

16 years 26 days ago

Download users.soe.ucsc.edu

We propose an algorithm for extracting ﬁelds from HTML search results. The output of the algorithm is a database table– a data structure that better lends itself to high-level...

Philip Zigoris, Damian Eads, Yi Zhang

claim paper

Read More »

200

click to vote

WAW
2007
Springer

144views Algorithms» more WAW 2007»

Approximating Betweenness Centrality

16 years 28 days ago

Download www.cc.gatech.edu

Betweenness is a centrality measure based on shortest paths, widely used in complex network analysis. It is computationally-expensive to exactly determine betweenness; currently th...

David A. Bader, Shiva Kintali, Kamesh Madduri, Mil...

claim paper

Read More »

225

click to vote

SIGIR
2008
ACM

176views Information Technology» more SIGIR 2008»

SpotSigs: robust and efficient near duplicate detection in large web collections

15 years 6 months ago

Download ilpubs.stanford.edu

Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...

Martin Theobald, Jonathan Siddharth, Andreas Paepc...

claim paper

Read More »

« Prev « First page 20 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers