On Improving Pseudo-Relevance Feedback Using Pseudo-Irrelevant Documents

14 years 1 months ago

Download www.cse.iitb.ac.in

Abstract. Pseudo-Relevance Feedback (PRF) assumes that the topranking n documents of the initial retrieval are relevant and extracts expansion terms from them. In this work, we introduce the notion of pseudo-irrelevant documents, i.e. high-scoring documents outside of top n that are highly unlikely to be relevant. We show how pseudo-irrelevant documents can be used to extract better expansion terms from the topranking n documents: good expansion terms are those which discriminate the top-ranking n documents from the pseudo-irrelevant documents. Our approach gives substantial improvements in retrieval performance over Model-based Feedback on several test collections. Key words: Information Retrieval, Pseudo-Relevance Feedback, Query Expansion, Pseudo-Irrelevance, Linear Classifier

Karthik Raman, Raghavendra Udupa, Pushpak Bhattach

Real-time Traffic

ECIR 2010 | Expansion Terms | Information Technology | Pseudo-irrelevant Documents | Pseudo-relevance Feedback |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	ECIR
Authors	Karthik Raman, Raghavendra Udupa, Pushpak Bhattacharyya, Abhijit Bhole

Comments (0)

Sciweavers

On Improving Pseudo-Relevance Feedback Using Pseudo-Irrelevant Documents

ECIR 2010 | Expansion Terms | Information Technology | Pseudo-irrelevant Documents | Pseudo-relevance Feedback |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers