Measuring the Reusability of Test Collections

15 years 11 months ago

Download www.wsdm-conference.org

While test collection construction is a time-consuming and expensive process, the true cost is amortized by reusing the collection over hundreds or thousands of experiments. Some of these experiments may involve systems that retrieve documents not judged during the initial construction phase, and some of these systems may be "hard" to evaluate: depending on which judgments are missing and which judged documents were retrieved, the experimenter's confidence in an evaluation could potentially be very low. We propose two methods for quantifying the reusability of a test collection for evaluating new systems. The proposed methods provide simple yet highly effective tests for determining whether an existing set of judgments is useful for evaluating a new system. Empirical evaluations using TREC datasets confirm the usefulness of our proposed reusability measures. In particular, we show that our methods can reliably estimate confidence intervals that are indicative of collect...

Ben Carterette, Evgeniy Gabrilovich, Vanja Josifov

Real-time Traffic

Collection Reusability | Data Mining | Reusability Measures | Test Collection Construction | WSDM 2010 |

claim paper

» Building a reusable test collection for question answering

» Building a question answering test collection

» Measures for Mobile Users

» Toward Libraries for RealTime Java

» Software Process The Key to Developing Robust Reusable and Maintainable OpenSource Softwar...

» Characterizing and Modeling the Cost of Rework in a Library of Reusable Software Component...

» Measuring Information Understanding in Large Document Collections

» Reusable anonymous return channels

Post Info
More Details (n/a)

Added	01 Mar 2010
Updated	02 Mar 2010
Type	Conference
Year	2010
Where	WSDM
Authors	Ben Carterette, Evgeniy Gabrilovich, Vanja Josifovski, Donald Metzler

Comments (0)

Sciweavers

Measuring the Reusability of Test Collections

Collection Reusability | Data Mining | Reusability Measures | Test Collection Construction | WSDM 2010 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers