Search Sciweavers | Sciweavers

178 search results - page 21 / 36

» Scheduling Algorithms for Web Crawling

138

click to vote

CIKM
2009
Springer

121views Information Technology» more CIKM 2009»

Graph-based seed selection for web-scale crawlers

16 years 1 months ago

Download clgiles.ist.psu.edu

One of the most important steps in web crawling is determining the starting points, or seed selection. This paper identiﬁes and explores the problem of seed selection in webscal...

Shuyi Zheng, Pavel Dmitriev, C. Lee Giles

claim paper

Read More »

193

click to vote

SIGIR
2005
ACM

150views Information Technology» more SIGIR 2005»

Server selection methods in hybrid portal search

16 years 11 days ago

Download es.csiro.au

The TREC .GOV collection makes a valuable web testbed for distributed information retrieval methods because it is naturally partitioned and includes 725 web-oriented queries with ...

David Hawking, Paul Thomas

claim paper

Read More »

171

click to vote

INFOCOM
2002
IEEE

136views Communications» more INFOCOM 2002»

Session-Based Overload Control in QoS-Aware Web Servers

15 years 11 months ago

Download www.cse.iitb.ac.in

—With the explosive use of Internet, contemporary web servers are susceptible to overloads and their services deteriorate drastically and often cause denial of services. In this ...

Huamin Chen, Prasant Mohapatra

claim paper

Read More »

176

click to vote

WWW
2001
ACM

150views Internet Technology» more WWW 2001»

Effective Web data extraction with standard XML technologies

16 years 7 months ago

Download www10.org

We discuss the problem of Web data extraction and describe an XML-based methodology whose goal extends far beyond simple "screen scraping." An ideal data extraction proc...

Jussi Myllymaki

claim paper

Read More »

191

click to vote

WWW
2005
ACM

195views Internet Technology» more WWW 2005»

Three-level caching for efficient query processing in large Web search engines

16 years 7 months ago

Download cis.poly.edu

Large web search engines have to answer thousands of queries per second with interactive response times. Due to the sizes of the data sets involved, often in the range of multiple...

Xiaohui Long, Torsten Suel

claim paper

Read More »

« Prev « First page 21 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers