Sciweavers

6240 search results - page 49 / 1248
» From Internet Information Searching to Information Summarizi...
Sort
View
WWW
2005
ACM
14 years 9 months ago
Web data cleansing for information retrieval using key resource page selection
With the page explosion of WWW, how to cover more useful information with limited storage and computation resources becomes more and more important in web IR research. Using web p...
Yiqun Liu, Canhui Wang, Min Zhang, Shaoping Ma
WWW
2008
ACM
14 years 9 months ago
Generating hypotheses from the web
Hypothesis generation is a crucial initial step for making scientific discoveries. This paper addresses the problem of automatically discovering interesting hypotheses from the we...
Wei Jin, Rohini K. Srihari, Abhishek Singh
WWW
2004
ACM
14 years 9 months ago
Lessons from a Gnutella-web gateway
We present a gateway between the WWW and the Gnutella peer-topeer network that permits searchers on one side to be able to search and retrieve files on the other side of the gatew...
Brian D. Davison, Wei Zhang, Baoning Wu
WWW
2008
ACM
14 years 9 months ago
Recrawl scheduling based on information longevity
It is crucial for a web crawler to distinguish between ephemeral and persistent content. Ephemeral content (e.g., quote of the day) is usually not worth crawling, because by the t...
Christopher Olston, Sandeep Pandey
SIGIR
2000
ACM
14 years 1 months ago
Interactive Internet search: keyword, directory and query reformulation mechanisms compared
This article compares search effectiveness when using query-based Internet search (via the Google search engine), directory-based search (via Yahoo) and phrasebased query reformul...
Peter Bruza, Robert McArthur, Simon Dennis