Sciweavers

CIKM
1999
Springer

A General Language Model for Information Retrieval

14 years 3 months ago
A General Language Model for Information Retrieval
Statistical language modeling has been successfully used for speech recognition, part-of-speech tagging, and syntactic parsing. Recently, it has also been applied to information retrieval. According to this new paradigm, each document is viewed as a language sample, and a query as a generation process. The retrieved documents are ranked based on the probabilities of producing a query from the corresponding language models of these documents. In this paper, we will present a new language model for information retrieval, which is based on a range of data smoothing techniques, including the Good-Turing estimate, curve-fitting functions, and model combinations. Our model is conceptually simple and intuitive, and can be easily extended to incorporate probabilities of phrases such as word pairs and word triples. The experiments with the Wall Street Journal and TREC4 data sets showed that the performance of our model is comparable to that of INQUERY and better than that of another language m...
Fei Song, W. Bruce Croft
Added 03 Aug 2010
Updated 03 Aug 2010
Type Conference
Year 1999
Where CIKM
Authors Fei Song, W. Bruce Croft
Comments (0)