While the Web makes an increasing number of ontologies widely available for applications, how to discover ontologies becomes a more challenging issue. Existing approaches are mainl...
Even prior to content, the genre of a web document leads to a first coarse binary classification of the recall space in relevant and non-relevant documents. Thinking of a genre se...
Andrea Stubbe, Christoph Ringlstetter, Randy Goebe...
We present S3 , a system that implicitly captures the process and products of Web investigations (exploratory searches involving multiple queries). This automatically-created, pers...
: We describe our participation in the TREC 2008 Enterprise track and detail our language modeling-based approaches. For document search, our focus was on query expansion using pro...
The rapid growth of the web has been noted and tracked extensively. Recent studies have however documented the dual phenomenon: web pages have small half lives, and thus the web e...
Ziv Bar-Yossef, Andrei Z. Broder, Ravi Kumar, Andr...