Search Sciweavers | Sciweavers

1133 search results - page 3 / 227

» Distributed community crawling

199

Voted

ADMA
2009
Springer

142views Data Mining» more ADMA 2009»

Crawling Deep Web Using a New Set Covering Algorithm

16 years 1 months ago

Download cs.uwindsor.ca

Abstract. Crawling the deep web often requires the selection of an appropriate set of queries so that they can cover most of the documents in the data source with low cost. This ca...

Yan Wang, Jianguo Lu, Jessica Chen

claim paper

Read More »

163

click to vote

CN
2000

75views more CN 2000»

Graph structure in the Web

15 years 6 months ago

Download www.cis.upenn.edu

The study of the web as a graph is not only fascinating in its own right, but also yields valuable insight into web algorithms for crawling, searching and community discovery, and...

Andrei Z. Broder, Ravi Kumar, Farzin Maghoul, Prab...

claim paper

Read More »

149

click to vote

JASIS
2008

86views more JASIS 2008»

Metadata harvesting for content-based distributed information retrieval

15 years 6 months ago

Download doc.rero.ch

We propose an approach to content-based Distributed Information Retrieval based on the periodic and incremental centralisation of full-content indices of widely dispersed and auto...

Fabio Simeoni, Murat Yakici, Steve Neely, Fabio Cr...

claim paper

Read More »

191

click to vote

PVLDB
2008

124views more PVLDB 2008»

Google's Deep Web crawl

15 years 6 months ago

Download www.cs.cornell.edu

The Deep Web, i.e., content hidden behind HTML forms, has long been acknowledged as a significant gap in search engine coverage. Since it represents a large portion of the structu...

Jayant Madhavan, David Ko, Lucja Kot, Vignesh Gana...

claim paper

Read More »

188

click to vote

SIGIR
2003
ACM

159views Information Technology» more SIGIR 2003»

Apoidea: A Decentralized Peer-to-Peer Architecture for Crawling the World Wide Web

15 years 12 months ago

Download www.aameeksingh.com

This paper describes a decentralized peer-to-peer model for building a Web crawler. Most of the current systems use a centralized client-server model, in which the crawl is done by...

Aameek Singh, Mudhakar Srivatsa, Ling Liu, Todd Mi...

claim paper

Read More »

« Prev « First page 3 / 227 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers