Sciweavers

611 search results - page 34 / 123
» Random web crawls
Sort
View
EDBTW
2010
Springer
13 years 8 months ago
Using visual pages analysis for optimizing web archiving
Due to the growing importance of the World Wide Web, archiving it has become crucial for preserving useful source of information. To maintain a web archive up-to-date, crawlers ha...
Myriam Ben Saad, Stéphane Gançarski
CHI
2009
ACM
14 years 10 months ago
Resonance on the web: web dynamics and revisitation patterns
The Web is a dynamic, ever-changing collection of information accessed in a dynamic way. This paper explores the relationship between Web page content change (obtained from an hou...
Eytan Adar, Jaime Teevan, Susan T. Dumais
ACL
2012
12 years 14 days ago
Learning to Find Translations and Transliterations on the Web
In this paper, we present a new method for learning to finding translations and transliterations on the Web for a given term. The approach involves using a small set of terms and ...
Joseph Z. Chang, Jason S. Chang, Jyh-Shing Roger J...
WWW
2009
ACM
14 years 10 months ago
Detecting soft errors by redirection classification
A soft error redirection is a URL redirection to a page that returns the HTTP status code 200 (OK) but has actually no relevant content to the client request. Since such redirecti...
Taehyung Lee, Jinil Kim, Jin Wook Kim, Sung-Ryul K...
WEBDB
2005
Springer
102views Database» more  WEBDB 2005»
14 years 3 months ago
Design and Implementation of a Geographic Search Engine
In this paper, we describe the design and initial implementation of a geographic search engine prototype for Germany, based on a large crawl of the de domain. Geographic search en...
Alexander Markowetz, Yen-Yu Chen, Torsten Suel, Xi...