Bias and the limits of pooling

16 years 28 days ago

Download www.cs.umbc.edu

Modern retrieval test collections are built through a process called pooling in which only a sample of the entire document set is judged for each topic. The idea behind pooling is to ﬁnd enough relevant documents such that when unjudged documents are assumed to be nonrelevant the resulting judgment set is suﬃciently complete and unbiased. Yet a constant-size pool represents an increasingly small percentage of the document set as document sets grow larger, and at some point the assumption of approximately complete judgments must become invalid. This paper shows that the judgment sets produced by traditional pooling when the pools are too small relative to the total document set size can be biased in that they favor relevant documents that contain topic title words. This phenomenon is wholly dependent on the collection size and does not depend on the number of relevant documents for a given topic. We show that the AQUAINT test collection constructed in the recent TREC 2005 workshop ...

Chris Buckley, Darrin Dimmick, Ian Soboroff, Ellen

Real-time Traffic

Document Set | Relevant Documents | SIGIR 2006 | Test Collections |

claim paper

» poolHiTS A Shifted Transversal Design based pooling strategy for highthroughput drug scree...

» poolMC Smart pooling of mRNA samples in microarray experiments

» Limits of bias based assist methods in nanoscale 6T SRAM

» Elastic Rate Limiting for Spatially Biased Wireless Mesh Networks

» Stratification bias in low signal microarray studies

» Bias correction and Bayesian analysis of aggregate counts in SAGE libraries

» A twosample Bayesian ttest for microarray data

» The resource pooling principle

Post Info
More Details (n/a)

Added	14 Jun 2010
Updated	14 Jun 2010
Type	Conference
Year	2006
Where	SIGIR
Authors	Chris Buckley, Darrin Dimmick, Ian Soboroff, Ellen M. Voorhees

Comments (0)

Sciweavers

Bias and the limits of pooling

Document Set | Relevant Documents | SIGIR 2006 | Test Collections |

Explore & Download

Productivity Tools

Sciweavers