Sciweavers

SIGIR
2009
ACM

Compression-based document length prior for language models

14 years 7 months ago
Compression-based document length prior for language models
The inclusion of document length factors has been a major topic in the development of retrieval models. We believe that current models can be further improved by more refined estimations of the document’s scope. In this poster we present a new document length prior that uses the size of the compressed document. This new prior is introduced in the context of Language Modeling with Dirichlet smoothing. The evaluation performed on several collections shows significant improvements in effectiveness. Categories and Subject Descriptors: H.3.3 [Information Search and Retrieval]: Retrieval models General Terms: Performance, Experimentation.
Javier Parapar, David E. Losada, Alvaro Barreiro
Added 28 May 2010
Updated 28 May 2010
Type Conference
Year 2009
Where SIGIR
Authors Javier Parapar, David E. Losada, Alvaro Barreiro
Comments (0)