Sciweavers

295 search results - page 36 / 59
» Web Crawling
Sort
View
WEBDB
2005
Springer
124views Database» more  WEBDB 2005»
14 years 3 months ago
JXP: Global Authority Scores in a P2P Network
This document presents the JXP algorithm for dynamically and collaboratively computing PageRank-style authority scores of Web pages distributed in a P2P network. In the architectu...
Josiane Xavier Parreira, Gerhard Weikum
PVLDB
2008
141views more  PVLDB 2008»
13 years 9 months ago
WebTables: exploring the power of tables on the web
The World-Wide Web consists of a huge number of unstructured documents, but it also contains structured data in the form of HTML tables. We extracted 14.1 billion HTML tables from...
Michael J. Cafarella, Alon Y. Halevy, Daisy Zhe Wa...
WISE
2005
Springer
14 years 3 months ago
Temporal Ranking of Search Engine Results
Existing search engines contain the picture of the Web from the past and their ranking algorithms are based on data crawled some time ago. However, a user requires not only relevan...
Adam Jatowt, Yukiko Kawai, Katsumi Tanaka
PREMI
2011
Springer
13 years 21 days ago
Finding Potential Seeds through Rank Aggregation of Web Searches
This paper presents a potential seed selection algorithm for web crawlers using a gain - share scoring approach. Initially we consider a set of arbitrarily chosen tourism queries. ...
Rajendra Prasath, Pinar Öztürk
CIKM
2009
Springer
14 years 4 months ago
Identifying comparable entities on the web
Web search engines are often presented with user queries that involve comparisons of real-world entities. Thus far, this interaction has typically been captured by users submittin...
Alpa Jain, Patrick Pantel