Search Sciweavers | Sciweavers

472 search results - page 7 / 95

» Crawling the Hidden Web

153

click to vote

ADMA
2009
Springer

142views Data Mining» more ADMA 2009»

Crawling Deep Web Using a New Set Covering Algorithm

15 years 10 months ago

Download cs.uwindsor.ca

Abstract. Crawling the deep web often requires the selection of an appropriate set of queries so that they can cover most of the documents in the data source with low cost. This ca...

Yan Wang, Jianguo Lu, Jessica Chen

claim paper

Read More »

105

click to vote

WWW
2004
ACM

106views Internet Technology» more WWW 2004»

Distributed community crawling

16 years 4 months ago

Download www.iw3c2.org

The massive distribution of the crawling task can lead to inefficient exploration of the same portion of the Web. We propose a technique to guide crawlers exploration based on the...

Fabrizio Costa, Paolo Frasconi

claim paper

Read More »

108

click to vote

WWW
2007
ACM

162views Internet Technology» more WWW 2007»

Detecting near-duplicates for web crawling

16 years 4 months ago

Download infolab.stanford.edu

Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...

Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma

claim paper

Read More »

126

Voted

STOC
2002
ACM

95views Algorithms» more STOC 2002»

Crawling on web graphs

16 years 3 months ago

Download www.math.cmu.edu

Colin Cooper, Alan M. Frieze

claim paper

Read More »

140

click to vote

WWW
2005
ACM

151views Internet Technology» more WWW 2005»

User-centric Web crawling

16 years 4 months ago

Download www2005.org

Search engines are the primary gateways of information access on the Web today. Behind the scenes, search engines crawl the Web to populate a local indexed repository of Web pages...

Sandeep Pandey, Christopher Olston

claim paper

Read More »

« Prev « First page 7 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers