Search Sciweavers | Sciweavers

140

Voted

WWW
2002
ACM

107views Internet Technology» more WWW 2002»

16 years 7 months ago

In this paper we study how we can design an effective parallel crawler. As the size of the Web grows, it becomes imperative to parallelize a crawling process, in order to finish d...

Junghoo Cho, Hector Garcia-Molina

claim paper

Read More »

148

click to vote

WWW
2007
ACM

198views Internet Technology» more WWW 2007»

Parallel crawling for online social networks

16 years 7 months ago

Download www2007.org

Given a huge online social network, how do we retrieve information from it through crawling? Even better, how do we improve the crawling performance by using parallel crawlers tha...

Duen Horng Chau, Shashank Pandit, Samuel Wang, Chr...

claim paper

Read More »

124

click to vote

ICETE
2007

91views Business» more ICETE 2007»

Scrawler: A Seed-By-Seed Parallel Web Crawler

15 years 8 months ago

Download dblab.ssu.ac.kr

Joo Yong Lee, Sang Ho Lee, Yanggon Kim

claim paper

Read More »

164

click to vote

GPC
2010
Springer

148views Distributed And Parallel Com...» more GPC 2010»

A Focused Crawler with Ontology-Supported Website Models for Information Agents

15 years 8 months ago

Download mail.sju.edu.tw

This paper advocated the use of ontology-supported website models to provide a semantic level solution for an information agent so that it can provide fast, precise, and stable que...

Sheng-Yuan Yang

claim paper

Read More »

157

Voted

PDP
2008
IEEE

83views Distributed And Parallel Com...» more PDP 2008»

Bulk-Synchronous On-Line Crawling on Clusters of Computers

16 years 1 months ago

Download research.yahoo.com

This paper describes the design of a crawler devised to perform the periodic retrieval of Web documents for a search engine able to accept on-line updates in a concurrent manner. ...

Mauricio Marín, Carolina Bonacic

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers