Sciweavers

1914 search results - page 34 / 383
» Predicting Web Information Content
Sort
View
WWW
2008
ACM
14 years 9 months ago
Recrawl scheduling based on information longevity
It is crucial for a web crawler to distinguish between ephemeral and persistent content. Ephemeral content (e.g., quote of the day) is usually not worth crawling, because by the t...
Christopher Olston, Sandeep Pandey
SIGIR
2010
ACM
14 years 18 days ago
Predicting query performance on the web
Niranjan Balasubramanian, Giridhar Kumaran, Vitor ...
EDBT
2006
ACM
112views Database» more  EDBT 2006»
14 years 8 months ago
Indexing Shared Content in Information Retrieval Systems
Abstract. Modern document collections often contain groups of documents with overlapping or shared content. However, most information retrieval systems process each document separa...
Andrei Z. Broder, Nadav Eiron, Marcus Fontoura, Mi...
ISI
2007
Springer
14 years 2 months ago
Terrorism and Crime Related Weblog Social Network: Link, Content Analysis and Information Visualization
—A Weblog is a Web site where entries are made in diary style, maintained by its sole author – a blogger, and displayed in a reverse chronological order. Due to the freedom and...
Christopher C. Yang, Tobun D. Ng
SEMWEB
2007
Springer
14 years 2 months ago
HealthFinland - Finnish Health Information on the Semantic Web
This paper shows how semantic web techniques can be applied to solving problems of distributed content creation, discovery, linking, aggregation, and reuse in health information po...
Eero Hyvönen, Kim Viljanen, Osma Suominen