Sciweavers

472 search results - page 9 / 95
» Crawling the Hidden Web
Sort
View
DEBU
2002
135views more  DEBU 2002»
13 years 7 months ago
Analyzing Fine-grained Hypertext Features for Enhanced Crawling and Topic Distillation
Early Web search engines closely resembled Information Retrieval (IR) systems which had matured over several decades. Around 1996
Soumen Chakrabarti, Ravindra Jaju
ICML
2003
IEEE
14 years 8 months ago
Evolving Strategies for Focused Web Crawling
Judy Johnson, Kostas Tsioutsiouliklis, C. Lee Gile...
LPNMR
2001
Springer
14 years 6 hour ago
Declarative Information Extraction, Web Crawling, and Recursive Wrapping with Lixto
Lixto is a system and method for the visual and interactive generation of wrappers for Web pages under the supervision of a human developer, for automatically extracting informatio...
Robert Baumgartner, Sergio Flesca, Georg Gottlob
ICDE
2007
IEEE
167views Database» more  ICDE 2007»
14 years 9 months ago
DSphere: A Source-Centric Approach to Crawling, Indexing and Searching the World Wide Web
We describe DSPHERE1 - a decentralized system for crawling, indexing, searching and ranking of documents in the World Wide Web. Unlike most of the existing search technologies tha...
Bhuvan Bamba, Ling Liu, James Caverlee, Vaibhav Pa...
ADC
2004
Springer
79views Database» more  ADC 2004»
14 years 29 days ago
Performance and Cost Tradeoffs in Web Search.
Web search engines crawl the web to fetch the data that they index. In this paper we re-examine that need, and evaluate the network costs associated with data acquisition, and alt...
Nick Craswell, Francis Crimmins, David Hawking, Al...