Search Sciweavers | Sciweavers

30

PDP
2008
IEEE

83views Distributed And Parallel Com...» more PDP 2008»

Bulk-Synchronous On-Line Crawling on Clusters of Computers

14 years 5 months ago

This paper describes the design of a crawler devised to perform the periodic retrieval of Web documents for a search engine able to accept on-line updates in a concurrent manner. ...

Mauricio Marín, Carolina Bonacic

claim paper

Read More »

29

click to vote

PVLDB
2008

124views more PVLDB 2008»

Google's Deep Web crawl

13 years 10 months ago

Download www.cs.cornell.edu

The Deep Web, i.e., content hidden behind HTML forms, has long been acknowledged as a significant gap in search engine coverage. Since it represents a large portion of the structu...

Jayant Madhavan, David Ko, Lucja Kot, Vignesh Gana...

claim paper

Read More »

31

click to vote

VISUAL
1999
Springer

107views Information Technology» more VISUAL 1999»

Crawling for Images on the WWW

14 years 3 months ago

Download oak.cs.ucla.edu

Search engines are useful because they allow the user to nd information of interest from the World-Wide Web. These engines use a crawler to gather information from Web sites. Howev...

Junghoo Cho, Sougata Mukherjea

claim paper

Read More »

42

click to vote

CIKM
2010
Springer

166views Information Technology» more CIKM 2010»

Crawling the web for structured documents

13 years 7 months ago

Download www.mendeley.com

Structured Information Retrieval is gaining a lot of interest in recent years, as this kind of information is becoming an invaluable asset for professional communities such as Sof...

Julián Urbano, Juan Loréns, Yorgos A...

claim paper

Read More »

33

click to vote

WWW
2001
ACM

113views Internet Technology» more WWW 2001»

Crawling the Hidden Web

14 years 11 months ago

Download www.dia.uniroma3.it

Current-day crawlers retrieve content only from the publicly indexable Web, i.e., the set of Web pages reachable purely by following hypertext links, ignoring search forms and pag...

Sriram Raghavan, Hector Garcia-Molina

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers