Sciweavers

108 search results - page 5 / 22
» sigir 2009
Sort
View
SIGIR
2009
ACM
14 years 5 months ago
A relevance-based topic model for news event tracking
Event tracking is the task of discovering temporal patterns of popular events from text streams. Existing approaches for event tracking have two limitations: scalability and inabi...
Viet Ha-Thuc, Yelena Mejova, Christopher Harris, P...
SIGIR
2009
ACM
14 years 5 months ago
Visualizing the problems with the INEX topics
Topics form a crucial component of a test collection. We show, through visualization, that the INEX 2008 topics have shortcomings, which questions their validity for evaluating XM...
Andrew Trotman, Maria del Rocio Gomez Crisostomo, ...
SIGIR
2009
ACM
14 years 5 months ago
Identifying the original contribution of a document via language modeling
Abstract. One major goal of text mining is to provide automatic methods to help humans grasp the key ideas in ever-increasing text corpora. To this effect, we propose a statistica...
Benyah Shaparenko, Thorsten Joachims
SIGIR
2009
ACM
14 years 5 months ago
Brute force and indexed approaches to pairwise document similarity comparisons with MapReduce
This paper explores the problem of computing pairwise similarity on document collections, focusing on the application of “more like this” queries in the life sciences domain. ...
Jimmy J. Lin
SIGIR
2009
ACM
14 years 5 months ago
Measuring the descriptiveness of web comments
This paper investigates whether Web comments are of descriptive nature, that is, whether the combined text of a set of comments is similar in topic to the commented object. If so,...
Martin Potthast