Sciweavers

611 search results - page 9 / 123
» Random web crawls
Sort
View
WWW
2002
ACM
14 years 8 months ago
Optimal crawling strategies for web search engines
Joel L. Wolf, Mark S. Squillante, Philip S. Yu, Ja...
DEBU
2002
135views more  DEBU 2002»
13 years 7 months ago
Analyzing Fine-grained Hypertext Features for Enhanced Crawling and Topic Distillation
Early Web search engines closely resembled Information Retrieval (IR) systems which had matured over several decades. Around 1996
Soumen Chakrabarti, Ravindra Jaju
ICML
2003
IEEE
14 years 8 months ago
Evolving Strategies for Focused Web Crawling
Judy Johnson, Kostas Tsioutsiouliklis, C. Lee Gile...
LPNMR
2001
Springer
14 years 4 days ago
Declarative Information Extraction, Web Crawling, and Recursive Wrapping with Lixto
Lixto is a system and method for the visual and interactive generation of wrappers for Web pages under the supervision of a human developer, for automatically extracting informatio...
Robert Baumgartner, Sergio Flesca, Georg Gottlob
ICDE
2007
IEEE
167views Database» more  ICDE 2007»
14 years 9 months ago
DSphere: A Source-Centric Approach to Crawling, Indexing and Searching the World Wide Web
We describe DSPHERE1 - a decentralized system for crawling, indexing, searching and ranking of documents in the World Wide Web. Unlike most of the existing search technologies tha...
Bhuvan Bamba, Ling Liu, James Caverlee, Vaibhav Pa...