Retrieval system evaluation: automatic evaluation versus incomplete judgments

14 years 28 days ago

Download wwwhome.cs.utwente.nl

In information retrieval (IR), research aiming to reduce the cost of retrieval system evaluations has been conducted along two lines: (i) the evaluation of IR systems with reduced (i.e. incomplete) amounts of manual relevance assessments, and (ii) the fully automatic evaluation of IR systems, thus foregoing the need for manual assessments altogether. The proposed methods in both areas are commonly evaluated by comparing their performance estimates for a set of systems to a ground truth (provided for instance by evaluating the set of systems according to mean average precision). In contrast, in this poster we compare an automatic system evaluation approach directly to two evaluations based on incomplete manual relevance assessments. For the particular case of TREC's Million Query track, we show that the automatic evaluation leads to results which are highly correlated to those achieved by approaches relying on incomplete manual judgments. Categories and Subject Descriptors: H.3.3 ...

Claudia Hauff, Franciska de Jong

Real-time Traffic

Evaluation | Information Technology | IR Systems | Manual Relevance Assessments | SIGIR 2010 |

claim paper

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	SIGIR
Authors	Claudia Hauff, Franciska de Jong

Comments (0)

Sciweavers

Retrieval system evaluation: automatic evaluation versus incomplete judgments

Evaluation | Information Technology | IR Systems | Manual Relevance Assessments | SIGIR 2010 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers