Sciweavers

299 search results - page 11 / 60
» User-centric Web crawling
Sort
View
ICDE
2006
IEEE
144views Database» more  ICDE 2006»
14 years 1 months ago
Finding Thai Web Pages in Foreign Web Spaces
While the Web has been increasingly recognized as a culturally valuable social artifact, many nations endeavor to create national Web archives for long term preservation. However, ...
Kulwadee Somboonviwat, Takayuki Tamura, Masaru Kit...
WIDM
2004
ACM
14 years 26 days ago
Probabilistic models for focused web crawling
A Focused crawler must use information gleaned from previously crawled page sequences to estimate the relevance of a newly seen URL. Therefore, good performance depends on powerfu...
Hongyu Liu, Evangelos E. Milios, Jeannette Janssen
WEBI
2010
Springer
13 years 5 months ago
Research Interests: Their Dynamics, Structures and Applications in Web Search Refinement
For most scientists, their research interests are dynamically changing all the time. Through an analysis of research interests, we find that all the changes are with some character...
Yi Zeng, Erzhong Zhou, Yulin Qin, Ning Zhong
WWW
2001
ACM
14 years 8 months ago
Crawling the Hidden Web
Current-day crawlers retrieve content only from the publicly indexable Web, i.e., the set of Web pages reachable purely by following hypertext links, ignoring search forms and pag...
Sriram Raghavan, Hector Garcia-Molina
SIGIR
2008
ACM
13 years 7 months ago
Compressed collections for simulated crawling
Collections are a fundamental tool for reproducible evaluation of information retrieval techniques. We describe a new method for distributing the document lengths and term counts ...
Alessio Orlandi, Sebastiano Vigna