Search Sciweavers | Sciweavers

82

Voted

WWW
2002
ACM

107views Internet Technology» more WWW 2002»

16 years 3 months ago

In this paper we study how we can design an effective parallel crawler. As the size of the Web grows, it becomes imperative to parallelize a crawling process, in order to finish d...

Junghoo Cho, Hector Garcia-Molina

claim paper

Read More »

122

Voted

ADCS
2004

205views Applied Computing» more ADCS 2004»

Focused Crawling in Depression Portal Search: A Feasibility Study

15 years 4 months ago

Download es.csiro.au

Previous work on domain specific search services in the area of depressive illness has documented the significant human cost required to setup and maintain closed-crawl parameters....

Thanh Tin Tang, David Hawking, Nick Craswell, Rame...

claim paper

Read More »

224

Voted

SIGMOD
2006
ACM

232views Database» more SIGMOD 2006»

To search or to crawl?: towards a query optimizer for text-centric tasks

16 years 2 months ago

Download pages.stern.nyu.edu

Text is ubiquitous and, not surprisingly, many important applications rely on textual data for a variety of tasks. As a notable example, information extraction applications derive...

Panagiotis G. Ipeirotis, Eugene Agichtein, Pranay ...

claim paper

Read More »

102

Voted

ICMLA
2008

181views Machine Learning» more ICMLA 2008»

A Fully Automatic Crossword Generator

15 years 4 months ago

Download nautilus.dii.unisi.it

This paper presents a software system that is able to generate crosswords with no human intervention including definition generation and crossword compilation. In particular, the ...

Leonardo Rigutini, Michelangelo Diligenti, Marco M...

claim paper

Read More »

128

click to vote

WWW
2008
ACM

91views Internet Technology» more WWW 2008»

IRLbot: scaling to 6 billion pages and beyond

16 years 3 months ago

Download irl.cs.tamu.edu

This paper shares our experience in designing a web crawler that can download billions of pages using a single-server implementation and models its performance. We show that with ...

Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, Dmit...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers