Sciweavers

178 search results - page 10 / 36
» Scheduling Algorithms for Web Crawling
Sort
View
WSDM
2009
ACM
176views Data Mining» more  WSDM 2009»
14 years 2 months ago
The web changes everything: understanding the dynamics of web content
The Web is a dynamic, ever changing collection of information. This paper explores changes in Web content by analyzing a crawl of 55,000 Web pages, selected to represent different...
Eytan Adar, Jaime Teevan, Susan T. Dumais, Jonatha...
STACS
2009
Springer
14 years 2 months ago
A Comparison of Techniques for Sampling Web Pages
As the World Wide Web is growing rapidly, it is getting increasingly challenging to gather representative information about it. Instead of crawling the web exhaustively one has to...
Eda Baykan, Monika Rauch Henzinger, Stefan F. Kell...
INFOCOM
2002
IEEE
14 years 14 days ago
Scheduling Algorithms for a Cache Pre-Filling Content Distribution Network
Abstract—Cache pre-filling is emerging as a new concept for increasing the availability of popular web items in cache servers. According to this concept, web items are sent by a...
Reuven Cohen, Liran Katzir, Danny Raz
AAMAS
2002
Springer
13 years 7 months ago
MySpiders: Evolve Your Own Intelligent Web Crawlers
The dynamic nature of the World Wide Web makes it a challenge to find information that is both relevant and recent. Intelligent agents can complement the power of search engines to...
Gautam Pant, Filippo Menczer
WWW
2010
ACM
13 years 11 months ago
Time is of the essence: improving recency ranking using Twitter data
Realtime web search refers to the retrieval of very fresh content which is in high demand. An effective portal web search engine must support a variety of search needs, including ...
Anlei Dong, Ruiqiang Zhang, Pranam Kolari, Jing Ba...