web crawler | Sciweavers

44

LAWEB
2003
IEEE

89views Internet Technology» more LAWEB 2003»

Cooperation Schemes between a Web Server and a Web Search Engine

14 years 8 months ago

Search engines provide search results based on a large repository of pages downloaded by a web crawler from several servers. To provide best results, this repository must be kept ...

Carlos Castillo

claim paper

Read More »

42

click to vote

WWW
2010
ACM

220views Internet Technology» more WWW 2010»

Not so creepy crawler: easy crawler generation with standard xml queries

14 years 10 months ago

Download www2.pms.ifi.lmu.de

Web crawlers are increasingly used for focused tasks such as the extraction of data from Wikipedia or the analysis of social networks like last.fm. In these cases, pages are far m...

Franziska von dem Bussche, Klara A. Weiand, Benedi...

claim paper

Read More »

33

click to vote

WWW
2004
ACM

138views Internet Technology» more WWW 2004»

Design of a crawler with bounded bandwidth

15 years 3 months ago

Download www.iw3c2.org

This paper presents an algorithm to bound the bandwidth of a Web crawler. The crawler collects statistics on the transfer rate of each server to predict the expected bandwidth use...

Michelangelo Diligenti, Marco Maggini, Filippo Mar...

claim paper

Read More »

124

click to vote

ICDE
2002
IEEE

161views Database» more ICDE 2002»

Design and Implementation of a High-Performance Distributed Web Crawler

15 years 4 months ago

Download cis.poly.edu

Broad web search engines as well as many more specialized search tools rely on web crawlers to acquire large collections of pages for indexing and analysis. Such a web crawler may...

Vladislav Shkapenyuk, Torsten Suel

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers