Sciweavers

941 search results - page 142 / 189
» Keyword Search over Dynamic Categorized Information
Sort
View
AIRWEB
2009
Springer
14 years 2 months ago
A study of link farm distribution and evolution using a time series of web snapshots
In this paper, we study the overall link-based spam structure and its evolution which would be helpful for the development of robust analysis tools and research for Web spamming a...
Young-joo Chung, Masashi Toyoda, Masaru Kitsuregaw...
CIKM
2009
Springer
14 years 2 months ago
Graph-based seed selection for web-scale crawlers
One of the most important steps in web crawling is determining the starting points, or seed selection. This paper identifies and explores the problem of seed selection in webscal...
Shuyi Zheng, Pavel Dmitriev, C. Lee Giles
SIGIR
2006
ACM
14 years 1 months ago
A framework to predict the quality of answers with non-textual features
New types of document collections are being developed by various web services. The service providers keep track of non-textual features such as click counts. In this paper, we pre...
Jiwoon Jeon, W. Bruce Croft, Joon Ho Lee, Soyeon P...
HPDC
2010
IEEE
13 years 8 months ago
Efficient querying of distributed provenance stores
Current projects that automate the collection of provenance information use a centralized architecture for managing the resulting metadata - that is, provenance is gathered at rem...
Ashish Gehani, Minyoung Kim, Tanu Malik
WWW
2005
ACM
14 years 8 months ago
The volume and evolution of web page templates
Web pages contain a combination of unique content and template material, which is present across multiple pages and used primarily for formatting, navigation, and branding. We stu...
David Gibson, Kunal Punera, Andrew Tomkins