Sciweavers

SIGIR
1998
ACM

A Language Modeling Approach to Information Retrieval

14 years 4 months ago
A Language Modeling Approach to Information Retrieval
Abstract Models of document indexing and document retrieval have been extensively studied. The integration of these two classes of models has been the goal of several researchers but it is a very difficult problem. We argue that much of the reason for this is the lack of an adequate indexing model. This suggests that perhaps a better indexing model would help solve the problem. However, we feel that making unwarranted parametric assumptions will not lead to better retrieval performance. Furthermore, making prior assumptions about the similarity of documents is not warranted either. Instead, we propose an approach to retrieval based on probabilistic language modeling. We estimate models for each document individually. Our approach to modeling is non-parametric and integrates document indexing and document retrieval into a single model. One advantage of our approach is that collection statistics which are used heuristically in many other retrieval models are an integral part of our model...
Jay M. Ponte, W. Bruce Croft
Added 05 Aug 2010
Updated 05 Aug 2010
Type Conference
Year 1998
Where SIGIR
Authors Jay M. Ponte, W. Bruce Croft
Comments (0)