Sciweavers

3251 search results - page 534 / 651
» Challenges in Web Information Retrieval
Sort
View
SIGIR
2010
ACM
14 years 23 days ago
Adaptive near-duplicate detection via similarity learning
In this paper, we present a novel near-duplicate document detection method that can easily be tuned for a particular domain. Our method represents each document as a real-valued s...
Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz
AIRS
2004
Springer
14 years 19 days ago
Effective Topic Distillation with Key Resource Pre-selection
Topic distillation aims at finding key resources which are high-quality pages for certain topics. With analysis in non-content features of key resources, a pre-selection method is ...
Yiqun Liu, Min Zhang, Shaoping Ma
CIKM
2006
Springer
14 years 18 days ago
A system for query-specific document summarization
There has been a great amount of work on query-independent summarization of documents. However, due to the success of Web search engines query-specific document summarization (que...
Ramakrishna Varadarajan, Vagelis Hristidis
PRICAI
2000
Springer
14 years 13 days ago
Towards a Next-Generation Search Engine
As more information becomes available on the World Wide Web, it has become an acute problem to provide effective search tools for information access. Previous generations of search...
Qiang Yang, Hai-Feng Wang, Ji-Rong Wen, Gao Zhang,...
CIKM
2008
Springer
13 years 11 months ago
Vanity fair: privacy in querylog bundles
A recently proposed approach to address privacy concerns in storing web search querylogs is bundling logs of multiple users together. In this work we investigate privacy leaks tha...
Rosie Jones, Ravi Kumar, Bo Pang, Andrew Tomkins