Extending average precision to graded relevance judgments

15 years 6 months ago

Download kanoulas.staff.shef.ac.uk

Evaluation metrics play a critical role both in the context of comparative evaluation of the performance of retrieval systems and in the context of learning-to-rank (LTR) as objective functions to be optimized. Many diﬀerent evaluation metrics have been proposed in the IR literature, with average precision (AP) being the dominant one due a number of desirable properties it possesses. However, most of these measures, including average precision, do not incorporate graded relevance. In this work, we propose a new measure of retrieval eﬀectiveness, the Graded Average Precision (GAP). GAP generalizes average precision to the case of multi-graded relevance and inherits all the desirable characteristics of AP: it has a nice probabilistic interpretation, it approximates the area under a graded precision-recall curve and it can be justiﬁed in terms of a simple but moderately plausible user model. We then evaluate GAP in terms of its informativeness and discriminative power. Finally, we ...

Stephen E. Robertson, Evangelos Kanoulas, Emine Yi

Real-time Traffic

Average Precision | Diﬀerent Evaluation Metrics | Evaluation Metrics | Information Management | SIGIR 2010 |

claim paper

» Inferring document relevance via average precision

» Several methods of ranking retrieval systems with partial relevance judgment

» Ranking Retrieval Systems with Partial Relevance Judgements

» Hierarchical clustering of a Finnish newspaper article collection with graded relevance as...

» Retrieval system evaluation automatic evaluation versus incomplete judgments

» A statistical method for system evaluation using incomplete judgments

» Minimal test collections for retrieval evaluation

» Learning Ranking vs Modeling Relevance

Post Info
More Details (n/a)

Added	16 Aug 2010
Updated	16 Aug 2010
Type	Conference
Year	2010
Where	SIGIR
Authors	Stephen E. Robertson, Evangelos Kanoulas, Emine Yilmaz

Comments (0)

Sciweavers

Extending average precision to graded relevance judgments

Average Precision | Diﬀerent Evaluation Metrics | Evaluation Metrics | Information Management | SIGIR 2010 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers