Sciweavers

SIGIR
2005
ACM

Predicting query difficulty on the web by learning visual clues

14 years 5 months ago
Predicting query difficulty on the web by learning visual clues
We describe a method for predicting query difficulty in a precision-oriented web search task. Our approach uses visual features from retrieved surrogate document representations (titles, snippets, etc.) to predict retrieval effectiveness for a query. By training a supervised machine learning algorithm with manually evaluated queries, visual clues indicative of relevance are discovered. We show that this approach has a moderate correlation of 0.57 with precision at 10 scores from manual relevance judgments of the top ten documents retrieved by ten web search engines over 896 queries. Our findings indicate that difficulty predictors which have been successful in recall-oriented ad-hoc search, such as clarity metrics, are not nearly as correlated with engine performance in precision-oriented tasks such as this, yielding a maximum correlation of 0.3. Additionally, relying only on visual clues avoids the need for collection statistics that are required by these prior approaches. This enabl...
Eric C. Jensen, Steven M. Beitzel, David A. Grossm
Added 26 Jun 2010
Updated 26 Jun 2010
Type Conference
Year 2005
Where SIGIR
Authors Eric C. Jensen, Steven M. Beitzel, David A. Grossman, Ophir Frieder, Abdur Chowdhury
Comments (0)