Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

171

Voted

SIGIR
2009
ACM

123views Information Technology» more SIGIR 2009»

Compression-based document length prior for language models

16 years 1 months ago

Compression-based document length prior for language models

Download www.dc.fi.udc.es

The inclusion of document length factors has been a major topic in the development of retrieval models. We believe that current models can be further improved by more reﬁned estimations of the document’s scope. In this poster we present a new document length prior that uses the size of the compressed document. This new prior is introduced in the context of Language Modeling with Dirichlet smoothing. The evaluation performed on several collections shows signiﬁcant improvements in eﬀectiveness. Categories and Subject Descriptors: H.3.3 [Information Search and Retrieval]: Retrieval models General Terms: Performance, Experimentation.

Javier Parapar, David E. Losada, Alvaro Barreiro

Real-time Traffic

Document Length | Document Length Factors | Information Retrieval | Retrieval Models | SIGIR 2009 |

claim paper

Related Content

» Probabilistic Document Length Priors for Language Models

» An analysis on document length retrieval trends in language modeling smoothing

» Language Models for Searching in Web Corpora

» Unsupervised Segmentation of Words Using Prior Distributions of Morph Length and Frequency

» A new robust relevance model in the language model framework

» Length normalization in XML retrieval

» Lossless Compression Based on the Sequence Memoizer

» Webcentric language models

» Language Models and Smoothing Methods for Collections with Large Variation in Document Len...

Post Info
More Details (n/a)

Added	28 May 2010
Updated	28 May 2010
Type	Conference
Year	2009
Where	SIGIR
Authors	Javier Parapar, David E. Losada, Alvaro Barreiro

Comments (0)