focused crawling | Sciweavers

207

CN
1999

242views more CN 1999»

Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery

15 years 6 months ago

The rapid growth of the World-Wide Web poses unprecedented scaling challenges for general-purpose crawlers and search engines. In this paper we describe a new hypertext resource d...

Soumen Chakrabarti, Martin van den Berg, Byron Dom

claim paper

Read More »

176

click to vote

ECAI
2008
Springer

127views Artificial Intelligence» more ECAI 2008»

Reinforcement Learning with Classifier Selection for Focused Crawling

15 years 8 months ago

Download users.auth.gr

Focused crawlers are programs that wander in the Web, using its graph structure, and gather pages that belong to a specific topic. The most critical task in Focused Crawling is the...

Ioannis Partalas, Georgios Paliouras, Ioannis P. V...

claim paper

Read More »

178

click to vote

VLDB
2000
ACM

125views Database» more VLDB 2000»

Focused Crawling Using Context Graphs

15 years 10 months ago

Download clgiles.ist.psu.edu

Maintaining currency of search engine indices by exhaustive crawling is rapidly becoming impossible due to the increasing size and dynamic content of the web. Focused crawlers aim...

Michelangelo Diligenti, Frans Coetzee, Steve Lawre...

claim paper

Read More »

174

click to vote

AIIA
2001
Springer

130views Artificial Intelligence» more AIIA 2001»

Evaluation Methods for Focused Crawling

15 years 11 months ago

Download www.dsi.unifi.it

The exponential growth of documents available in the World Wide Web makes it increasingly diﬃcult to discover relevant information on a speciﬁc topic. In this context, growing ...

Andrea Passerini, Paolo Frasconi, Giovanni Soda

claim paper

Read More »

164

click to vote

VLDB
2004
ACM

113views Database» more VLDB 2004»

Accurate and Efficient Crawling for Relevant Websites

15 years 12 months ago

Download www.vldb.org

Focused web crawlers have recently emerged as an alternative to the well-established web search engines. While the well-known focused crawlers retrieve relevant webpages, there ar...

Martin Ester, Hans-Peter Kriegel, Matthias Schuber...

claim paper

Read More »

206

click to vote

ICDM
2008
IEEE

186views Data Mining» more ICDM 2008»

xCrawl: A High-Recall Crawling Method for Web Mining

16 years 1 months ago

Download ls13-www.cs.uni-dortmund.de

Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The ﬁrst step in the Information Extract...

Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers