Sciweavers

472 search results - page 37 / 95
» Crawling the Hidden Web
Sort
View
SEMWEB
2007
Springer
14 years 1 months ago
Sindice.com: Weaving the Open Linked Data
Developers of Semantic Web applications face a challenge with respect to the decentralised publication model: where to find statements about encountered resources. The “linked d...
Giovanni Tummarello, Renaud Delbru, Eyal Oren
ISF
2011
13 years 2 months ago
A multi-region empirical study on the internet presence of global extremist organizations
Abstract Extremist organizations are heavily utilizing Internet technologies to increase their abilities to influence the world. Studying those global extremist organizations’ In...
Jialun Qin, Yilu Zhou, Hsinchun Chen
STACS
2009
Springer
14 years 2 months ago
A Comparison of Techniques for Sampling Web Pages
As the World Wide Web is growing rapidly, it is getting increasingly challenging to gather representative information about it. Instead of crawling the web exhaustively one has to...
Eda Baykan, Monika Rauch Henzinger, Stefan F. Kell...
CEAS
2007
Springer
14 years 1 months ago
Characterizing Web Spam Using Content and HTTP Session Analysis
Web spam research has been hampered by a lack of statistically significant collections. In this paper, we perform the first large-scale characterization of web spam using conten...
Steve Webb, James Caverlee, Calton Pu
EDBTW
2010
Springer
13 years 6 months ago
Using visual pages analysis for optimizing web archiving
Due to the growing importance of the World Wide Web, archiving it has become crucial for preserving useful source of information. To maintain a web archive up-to-date, crawlers ha...
Myriam Ben Saad, Stéphane Gançarski