Sciweavers

SIGIR
2011
ACM
13 years 5 months ago
When documents are very long, BM25 fails!
We reveal that the Okapi BM25 retrieval function tends to overly penalize very long documents. To address this problem, we present a simple yet effective extension of BM25, namel...
Yuanhua Lv, ChengXiang Zhai
JASIS
2010
121views more  JASIS 2010»
14 years 1 months ago
Linear time series models for term weighting in information retrieval
Common measures of term importance in information retrieval (IR) rely on counts of term frequency; rare terms receive higher weight in document ranking than common terms receive. ...
Miles Efron
ECIR
2010
Springer
14 years 4 months ago
Semantically Enhanced Term Frequency
In this paper, we complement the term frequency, which is used in many bag-of-words based information retrieval models, with information about the semantic relatedness of query and...
Christof Müller, Iryna Gurevych
SPIRE
2007
Springer
14 years 9 months ago
Extending Weighting Models with a Term Quality Measure
Abstract. Weighting models use lexical statistics, such as term frequencies, to derive term weights, which are used to estimate the relevance of a document to a query. Apart from t...
Christina Lioma, Iadh Ounis
ISDA
2008
IEEE
14 years 9 months ago
Compute the Term Contributed Frequency
In this paper, we propose an algorithm and data structure for computing the term contributed frequency (tcf) for all N-grams in a text corpus. Although term frequency is one of th...
Cheng-Lung Sung, Hsu-Chun Yen, Wen-Lian Hsu