Sciweavers

SIGIR
1999
ACM

Comparing the Performance of Database Selection Algorithms

14 years 4 months ago
Comparing the Performance of Database Selection Algorithms
Abstract We compare the performance of two database selection algorithms reported in the literature. Their performance is compared using a common testbed designed specifically for database selection techniques. The testbed is a decomposition of the TREC TIPSTER data into 236 subcollections. The databases from our testbed were ranked using both the gGlOSS and CORI techniques and compared to a baseline derived from TREC relevance judgements. We examined the degree to which CORI and gGlOSS approximate this baseline. Our results con rm our earlier observation that the gGlOSS Ideall ranks do not estimate relevancebased ranks well. We also ndthat CORI is a uniformly better estimator of relevance-based ranks than gGlOSS for the test environment used in this study. Part of the advantage of the CORI algorithm can be explained by a strong correlation between gGlOSS and a size-based baseline SBR. We also nd that CORI produces consistently accurate rankings on testbeds ranging from 100 921 sit...
James C. French, Allison L. Powell, James P. Calla
Added 03 Aug 2010
Updated 03 Aug 2010
Type Conference
Year 1999
Where SIGIR
Authors James C. French, Allison L. Powell, James P. Callan, Charles L. Viles, Travis Emmitt, Kevin J. Prey, Yun Mou
Comments (0)