Sciweavers

164 search results - page 12 / 33
» Finding Related Pages Using the Link Structure of the WWW
Sort
View
WWW
2008
ACM
14 years 8 months ago
IRLbot: scaling to 6 billion pages and beyond
This paper shares our experience in designing a web crawler that can download billions of pages using a single-server implementation and models its performance. We show that with ...
Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, Dmit...
WWW
2008
ACM
14 years 8 months ago
Pagerank for product image search
In this paper, we cast the image-ranking problem into the task of identifying "authority" nodes on an inferred visual similarity graph and propose an algorithm to analyz...
Yushi Jing, Shumeet Baluja
WWW
2009
ACM
14 years 8 months ago
Analysis of community structure in Wikipedia
We present the results of a community detection analysis of the Wikipedia graph. Distinct communities in Wikipedia contain semantically closely related articles. The central topic...
Dmitry Lizorkin, Olena Medelyan, Maria P. Grineva
VLDB
2003
ACM
125views Database» more  VLDB 2003»
14 years 7 months ago
THESUS: Organizing Web document collections based on link semantics
Abstract. The requirements for effective search and management of the WWW are stronger than ever. Currently Web documents are classified based on their content not taking into acco...
Maria Halkidi, Benjamin Nguyen, Iraklis Varlamis, ...
PRICAI
2000
Springer
13 years 11 months ago
Extracting Logical Schema from the Web
One of the main limitations when accessing the web is the lack of explicit structure, whose presence may help in understanding data semantics. Schema for web data can be constructe...
Vincenza Carchiolo, Alessandro Longheu, Michele Ma...