Sciweavers

299 search results - page 25 / 60
» User-centric Web crawling
Sort
View
AINA
2008
IEEE
14 years 3 months ago
Structure of the Thai Web Graph
This paper presents structural properties of the Thai Web graph. We conduct an empirical study on the Web graphs induced from two Thai web snapshots crawled during January 2007 (5...
Kulwadee Somboonviwat, Shinji Suzuki, Masaru Kitsu...
WWW
2006
ACM
14 years 9 months ago
Status of the African Web
As part of the Language Observatory Project [4], we have been crawling all the web space since 2004. We have collected terabytes of data mostly from Asian and African ccTLDs. In t...
Rizza Camus Caminero, Pavol Zavarsky, Yoshiki Mika...
ASWC
2006
Springer
14 years 11 days ago
Next Generation Semantic Web Applications
Watson is a gateway to the Semantic Web: it collects, analyzes and gives access to ontologies and semantic data available online. Its objective is to support the development of ne...
Enrico Motta, Marta Sabou
NSDI
2010
13 years 10 months ago
The Architecture and Implementation of an Extensible Web Crawler
Many Web services operate their own Web crawlers to discover data of interest, despite the fact that largescale, timely crawling is complex, operationally intensive, and expensive...
Jonathan M. Hsieh, Steven D. Gribble, Henry M. Lev...
ICWE
2005
Springer
14 years 2 months ago
Identifying Websites with Flow Simulation
We present in this paper a method to discover the set of webpages contained in a logical website, based on the link structure of the Web graph. Such a method is useful in the conte...
Pierre Senellart