Sciweavers

450 search results - page 49 / 90
» Content Collection for the Labelling of Health-Related Web C...
Sort
View
WEBDB
2007
Springer
162views Database» more  WEBDB 2007»
14 years 2 months ago
Term Ranking for Clustering Web Search Results
Clustering web search engine results for ambiguous keyword searches poses unique challenges. First, we show that one cannot readily import the frequency based feature ranking to c...
Fatih Gelgi, Hasan Davulcu, Srinivas Vadrevu
CW
2003
IEEE
14 years 2 months ago
Webspace Surfing Patterns and Their Impact on Web Prefetching
The paper presents an interesting study that how the user surfing behavior with respect to the organization of a web space affects the performance of a prefetch enabled proxy. We ...
Javed I. Khan, Qingping Tao
GISCIENCE
2008
Springer
121views GIS» more  GISCIENCE 2008»
13 years 9 months ago
Identifying Maps on the World Wide Web
Abstract. This paper presents an automatic approach to mining collections of maps from the Web. Our method harvests images from the Web and then classifies them as maps or non-map...
Matthew Michelson, Aman Goel, Craig A. Knoblock
ICDE
2008
IEEE
143views Database» more  ICDE 2008»
14 years 10 months ago
Efficient Discovery of Authoritative Resources
Abstract- Given a dynamic corpus whose content and attention are changing on a daily basis, is it possible to collect and maintain the high-quality resources with a minimal investm...
Ravi Kumar, Kevin Lang, Cameron Marlow, Andrew Tom...
COLING
2010
13 years 3 months ago
Large Scale Parallel Document Mining for Machine Translation
A distributed system is described that reliably mines parallel text from large corpora. The approach can be regarded as cross-language near-duplicate detection, enabled by an init...
Jakob Uszkoreit, Jay Ponte, Ashok C. Popat, Moshe ...