Sciweavers

472 search results - page 22 / 95
» Crawling the Hidden Web
Sort
View
ECIR
2006
Springer
13 years 9 months ago
Efficient Parallel Computation of PageRank
Abstract. PageRank inherently is massively parallelizable and distributable, as a result of web's strict host-based link locality. In this paper we show that the Gau
Christian Kohlschütter, Paul-Alexandru Chirit...
ICMLA
2008
13 years 9 months ago
A Fully Automatic Crossword Generator
This paper presents a software system that is able to generate crosswords with no human intervention including definition generation and crossword compilation. In particular, the ...
Leonardo Rigutini, Michelangelo Diligenti, Marco M...
SIGMOD
2001
ACM
101views Database» more  SIGMOD 2001»
14 years 7 months ago
Probe, Count, and Classify: Categorizing Hidden Web Databases
Panagiotis G. Ipeirotis, Luis Gravano, Mehran Saha...
DASFAA
2007
IEEE
181views Database» more  DASFAA 2007»
14 years 1 months ago
Graph Structure of the Korea Web
The study of the Web graph not only yields valuable insight into Web algorithms for crawling, searching and community discovery, and the sociological phenomena that characterize it...
In Kyu Han, Sang Ho Lee, Soowon Lee
WWW
2009
ACM
14 years 8 months ago
The web of nations
In this paper, we report on a large-scale study of structural differences among the national webs. The study is based on a webscale crawl conducted in the summer 2008. More specif...
Sukwon Chung, Dungjit Shiowattana, Pavel Dmitriev,...