Sciweavers

295 search results - page 2 / 59
» Web Crawling
Sort
View
IC
2004
13 years 10 months ago
IPMicra: An IP-address based Location Aware Distributed Web Crawler
Distributed crawling is able to overcome important limitations of the traditional single-sourced web crawling systems. However, the optimal benefit of distributed crawling is usual...
Odysseas Papapetrou, George Samaras
DMKD
2004
ACM
121views Data Mining» more  DMKD 2004»
14 years 8 days ago
Discovery of ads web hosts through traffic data analysis
One of the most actual problems on web crawling
V. Bacarella, Fosca Giannotti, Mirco Nanni, Dino P...
IC
2009
13 years 6 months ago
Language Based Crawling: Crawling the Arabic Content of the Web
- Crawling web pages written in Arabic or any other language with limited content in the web may, at first, seem to parallel the process of crawling the English content. However, t...
Saad H. Alabbad, Sultan Alanazi
VLDB
2000
ACM
125views Database» more  VLDB 2000»
14 years 3 days ago
Focused Crawling Using Context Graphs
Maintaining currency of search engine indices by exhaustive crawling is rapidly becoming impossible due to the increasing size and dynamic content of the web. Focused crawlers aim...
Michelangelo Diligenti, Frans Coetzee, Steve Lawre...
ICDM
2008
IEEE
186views Data Mining» more  ICDM 2008»
14 years 3 months ago
xCrawl: A High-Recall Crawling Method for Web Mining
Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The first step in the Information Extract...
Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...