CL Research’s question-answering system for TREC 2003 was modified away from reliance on database technology to the core underlying technology of using massive XML-tagging for p...
This paper proposes a novel application of a statistical language model to opinionated document retrieval targeting weblogs (blogs). In particular, we explore the use of the trigg...
Text reuse occurs in many different types of documents and for many different reasons. One form of reuse, duplicate or near-duplicate documents, has been a focus of researchers be...
ct 9 In this paper, a new novelty detection approach based on the identification of sentence level information patterns is 10 proposed. First, ``novelty'' is redefined ba...
Based on the important progresses made in information retrieval (IR) in terms of theoretical models and evaluations, more and more attention has recently been paid to the research...