Sciweavers

72 search results - page 12 / 15
» Ontology-Focused Crawling of Web Documents
Sort
View
PAKDD
2009
ACM
116views Data Mining» more  PAKDD 2009»
14 years 2 months ago
Scalable Web Mining with Newistic
Abstract. Newistic is a web mining platform that collects and analyses documents crawled from the Internet. Although it currently processes news articles, it can be easily adapted ...
Ovidiu Dan, Horatiu Mocian
WISE
2005
Springer
14 years 1 months ago
Temporal Ranking of Search Engine Results
Existing search engines contain the picture of the Web from the past and their ranking algorithms are based on data crawled some time ago. However, a user requires not only relevan...
Adam Jatowt, Yukiko Kawai, Katsumi Tanaka
WEBDB
2005
Springer
124views Database» more  WEBDB 2005»
14 years 1 months ago
JXP: Global Authority Scores in a P2P Network
This document presents the JXP algorithm for dynamically and collaboratively computing PageRank-style authority scores of Web pages distributed in a P2P network. In the architectu...
Josiane Xavier Parreira, Gerhard Weikum
CIKM
2004
Springer
14 years 1 months ago
Node ranking in labeled directed graphs
Our work is motivated by the problem of ranking hyperlinked documents for a given query. Given an arbitrary directed graph with edge and node labels, we present a new flow-based ...
Krishna Prasad Chitrapura, Srinivas R. Kashyap
JCDL
2010
ACM
188views Education» more  JCDL 2010»
14 years 29 days ago
Exposing the hidden web for chemical digital libraries
In recent years, the vast amount of digitally available content has lead to the creation of many topic-centered digital libraries. Also in the domain of chemistry more and more di...
Sascha Tönnies, Benjamin Köhncke, Oliver...