Sciweavers

326 search results - page 5 / 66
» Optimal crawling strategies for web search engines
Sort
View
DEBU
2002
135views more  DEBU 2002»
13 years 7 months ago
Analyzing Fine-grained Hypertext Features for Enhanced Crawling and Topic Distillation
Early Web search engines closely resembled Information Retrieval (IR) systems which had matured over several decades. Around 1996
Soumen Chakrabarti, Ravindra Jaju
ADC
2004
Springer
79views Database» more  ADC 2004»
14 years 24 days ago
Performance and Cost Tradeoffs in Web Search.
Web search engines crawl the web to fetch the data that they index. In this paper we re-examine that need, and evaluate the network costs associated with data acquisition, and alt...
Nick Craswell, Francis Crimmins, David Hawking, Al...
VISUAL
1999
Springer
13 years 11 months ago
Crawling for Images on the WWW
Search engines are useful because they allow the user to nd information of interest from the World-Wide Web. These engines use a crawler to gather information from Web sites. Howev...
Junghoo Cho, Sougata Mukherjea
INFOSCALE
2007
ACM
13 years 9 months ago
Mining query logs to optimize index partitioning in parallel web search engines
Large-scale Parallel Web Search Engines (WSEs) needs to adopt a strategy for partitioning the inverted index among a set of parallel server nodes. In this paper we are interested ...
Claudio Lucchese, Salvatore Orlando, Raffaele Pere...
SIGIR
2008
ACM
13 years 7 months ago
Exploring traversal strategy for web forum crawling
In this paper, we study the problem of Web forum crawling. Web forum has now become an important data source of many Web applications; while forum crawling is still a challenging ...
Yida Wang, Jiang-Ming Yang, Wei Lai, Rui Cai, Lei ...