Search Sciweavers | Sciweavers

563 search results - page 4 / 113

» Crawling the web for structured documents

260

Voted

CORR
2012
Springer

292views Education» more CORR 2012»

Optimal Threshold Control by the Robots of Web Search Engines with Obsolescence of Documents

14 years 2 months ago

Download www-sop.inria.fr

A typical web search engine consists of three principal parts: crawling engine, indexing engine, and searching engine. The present work aims to optimize the performance of the cra...

Konstantin Avrachenkov, Alexander N. Dudin, Valent...

claim paper

Read More »

211

Voted

CN
1999

242views more CN 1999»

Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery

15 years 6 months ago

Download www.cse.iitb.ac.in

The rapid growth of the World-Wide Web poses unprecedented scaling challenges for general-purpose crawlers and search engines. In this paper we describe a new hypertext resource d...

Soumen Chakrabarti, Martin van den Berg, Byron Dom

claim paper

Read More »

164

click to vote

CN
2000

75views more CN 2000»

Graph structure in the Web

15 years 6 months ago

Download www.cis.upenn.edu

The study of the web as a graph is not only fascinating in its own right, but also yields valuable insight into web algorithms for crawling, searching and community discovery, and...

Andrei Z. Broder, Ravi Kumar, Farzin Maghoul, Prab...

claim paper

Read More »

194

click to vote

WWW
2009
ACM

153views Internet Technology» more WWW 2009»

Sitemaps: above and beyond the crawl of duty

16 years 7 months ago

Download www2009.eprints.org

Comprehensive coverage of the public web is crucial to web search engines. Search engines use crawlers to retrieve pages and then discover new ones by extracting the pages' o...

Uri Schonfeld, Narayanan Shivakumar

claim paper

Read More »

178

Voted

WEBDB
2005
Springer

129views Database» more WEBDB 2005»

Searching for Hidden-Web Databases

16 years 6 days ago

Download www.cs.utah.edu

Recently, there has been increased interest in the retrieval and integration of hidden Web data with a view to leverage high-quality information available in online databases. Alt...

Luciano Barbosa, Juliana Freire

claim paper

Read More »

« Prev « First page 4 / 113 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers