Search Sciweavers | Sciweavers

198

ADCS
2004

205views Applied Computing» more ADCS 2004»

Focused Crawling in Depression Portal Search: A Feasibility Study

15 years 8 months ago

Previous work on domain specific search services in the area of depressive illness has documented the significant human cost required to setup and maintain closed-crawl parameters....

Thanh Tin Tang, David Hawking, Nick Craswell, Rame...

claim paper

Read More »

297

click to vote

SIGMOD
2006
ACM

232views Database» more SIGMOD 2006»

To search or to crawl?: towards a query optimizer for text-centric tasks

16 years 7 months ago

Download pages.stern.nyu.edu

Text is ubiquitous and, not surprisingly, many important applications rely on textual data for a variety of tasks. As a notable example, information extraction applications derive...

Panagiotis G. Ipeirotis, Eugene Agichtein, Pranay ...

claim paper

Read More »

148

click to vote

WWW
2002
ACM

107views Internet Technology» more WWW 2002»

Parallel crawlers

16 years 7 months ago

Download oak.cs.ucla.edu

In this paper we study how we can design an effective parallel crawler. As the size of the Web grows, it becomes imperative to parallelize a crawling process, in order to finish d...

Junghoo Cho, Hector Garcia-Molina

claim paper

Read More »

195

click to vote

WWW
2008
ACM

91views Internet Technology» more WWW 2008»

IRLbot: scaling to 6 billion pages and beyond

16 years 7 months ago

Download irl.cs.tamu.edu

This paper shares our experience in designing a web crawler that can download billions of pages using a single-server implementation and models its performance. We show that with ...

Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, Dmit...

claim paper

Read More »

173

click to vote

WWW
2008
ACM

124views Internet Technology» more WWW 2008»

iRobot: an intelligent crawler for web forums

16 years 7 months ago

Download www2008.org

We study in this paper the Web forum crawling problem, which is a very fundamental step in many Web applications, such as search engine and Web data mining. As a typical user-crea...

Rui Cai, Jiang-Ming Yang, Wei Lai, Yida Wang, Lei ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers