We investigate the topical structure of the set of documents used to expand a query in pseudorelevance feedback (PRF). We propose a coherence score to measure the relative topical...
Desktop search is an important part of personal information management (PIM). However, research in this area has been limited by the lack of shareable test collections, making cum...
Large web search engines have to answer thousands of queries per second with interactive response times. Due to the sizes of the data sets involved, often in the range of multiple...
This paper presents a sentence extraction method based on Concept Coupling Model, a language model for handling natural language sentence structures. Sentence extraction is perfor...
We present a query-driven algorithm for the distributed indexing of large document collections within structured P2P networks. To cope with bandwidth consumption that has been ide...
Gleb Skobeltsyn, Toan Luu, Ivana Podnar Zarko, Mar...