Sciweavers

125 search results - page 1 / 25
» Minimizing the Network Distance in Distributed Web Crawling
Sort
View
173
Voted
COOPIS
2004
IEEE
15 years 10 months ago
Minimizing the Network Distance in Distributed Web Crawling
Abstract. Distributed crawling has shown that it can overcome important limitations of the centralized crawling paradigm. However, the distributed nature of current distributed cra...
Odysseas Papapetrou, George Samaras
166
Voted
ADBIS
2003
Springer
173views Database» more  ADBIS 2003»
15 years 12 months ago
UCYMICRA: Distributed Indexing of the Web Using Migrating Crawlers
Due to the tremendous increase rate and the high change frequency of Web documents, maintaining an up-to-date index for searching purposes (search engines) is becoming a challenge....
Odysseas Papapetrou, Stavros Papastavrou, George S...
198
Voted
WWW
2003
ACM
16 years 7 months ago
Distributed Indexing of the Web Using Migrating Crawlers
Due to the tremendous increase rate and the high change frequency of Web documents, maintaining an up-to-date index for searching purposes (search engines) is becoming a challenge...
Odysseas Papapetrou, Stavros Papastavrou, George S...
162
Voted
WWW
2007
ACM
16 years 7 months ago
Detecting near-duplicates for web crawling
Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...
Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma
168
Voted
IPM
2008
133views more  IPM 2008»
15 years 6 months ago
DistanceRank: An intelligent ranking algorithm for web pages
A fast and efficient page ranking mechanism for web crawling and retrieval remains as a challenging issue. Recently, several link based ranking algorithms like PageRank, HITS and ...
Ali Mohammad Zareh Bidoki, Nasser Yazdani