Sciweavers

22 search results - page 3 / 5
» Efficient URL caching for world wide web crawling
Sort
View
STACS
2009
Springer
14 years 5 months ago
A Comparison of Techniques for Sampling Web Pages
As the World Wide Web is growing rapidly, it is getting increasingly challenging to gather representative information about it. Instead of crawling the web exhaustively one has to...
Eda Baykan, Monika Rauch Henzinger, Stefan F. Kell...
GCC
2005
Springer
14 years 4 months ago
Coordinated Placement and Replacement for Grid-Based Hierarchical Web Caches
Web caching has been well accepted as a viable method for saving network bandwidth and reducing user access latency. To provide cache sharing on a large scale, hierarchical web cac...
Wenzhong Li, Kun Wu, Xu Ping, Ye Tao, Sanglu Lu, D...
KYOTODL
2000
92views more  KYOTODL 2000»
14 years 5 days ago
Functions of a Web Warehouse
This paper proposes a web warehouse based approach to facilitating efficiency improvement, information sharing and service personalization for the World Wide Web. We will overview...
Kai Cheng, Yahiko Kambayashi, Seok Tae Lee, Mukesh...
CN
1999
87views more  CN 1999»
13 years 10 months ago
The Gecko NFS Web Proxy
The World-Wide Web provides remote access to pages using its own naming scheme (URLs), transfer protocol (HTTP), and cache algorithms. Not only does using these special-purpose me...
Scott M. Baker, John H. Hartman
WWW
2007
ACM
14 years 11 months ago
Efficient Update of Indexes for Dynamically Changing Web Documents
Recent work on incremental crawling has enabled the indexed document collection of a search engine to be more synchronized with the changing World Wide Web. However, this synchron...
Lipyeow Lim, Min Wang, Sriram Padmanabhan, Jeffrey...