Sciweavers

INEX
2007
Springer

INEX 2007 Evaluation Measures

14 years 6 months ago
INEX 2007 Evaluation Measures
Abstract. This paper describes the official measures of retrieval effectiveness that are employed for the Ad Hoc Track at INEX 2007. Whereas in earlier years all, but only, XML elements could be retrieved, the result format has been liberalized to arbitrary passages. In response, the INEX 2007 measures are based on the amount of highlighted text retrieved, leading to natural extensions of the well-established measures of precision and recall. The following measures are defined: The Focused Task is evaluated by interpolated precision at 1% recall (iP[0.01]) in terms of the highlighted text retrieved. The Relevant in Context Task is evaluated by mean average generalized precision (MAgP) where the generalized score per article is based on the retrieved highlighted text. The Best in Context Task is also evaluated by mean average generalized precision (MAgP) but here the generalized score per article is based on the distance to the assessor’s best-entry point.
Jaap Kamps, Jovan Pehcevski, Gabriella Kazai, Moun
Added 08 Jun 2010
Updated 08 Jun 2010
Type Conference
Year 2007
Where INEX
Authors Jaap Kamps, Jovan Pehcevski, Gabriella Kazai, Mounia Lalmas, Stephen Robertson
Comments (0)