If I Had a Million Queries

15 years 11 months ago

Download ir.cis.udel.edu

As document collections grow larger, the information needs and relevance judgments in a test collection must be well-chosen within a limited budget to give the most reliable and robust evaluation results. In this work we analyze a sample of queries categorized by length and corpus-appropriateness to determine the right proportion needed to distinguish between systems. We also analyze the appropriate division of labor between developing topics and making relevance judgments, and show that only a small, biased sample of queries with sparse judgments is needed to produce the same results as a much larger sample of queries.

Ben Carterette, Virgiliu Pavlu, Evangelos Kanoulas

Real-time Traffic

Computer Science | ECIR 2009 | Larger Sample | Relevance Judgments | Sparse Judgments |

claim paper

» Dismantling iClass and iClass Elite

» iSEGOPubmed a web interface for semantic enabled browsing of PubMed using Gene Ontology

» Green Query Optimization Taming Query Optimization Overheads through Plan Recycling

» Statistical Machine Translation for Query Expansion in Answer Retrieval

» Querylog mining for detecting spam

» Navigating largescale semistructured data in business portals

» Defeasible Logic

» Webcam Synopsis Peeking Around the World

Post Info
More Details (n/a)

Added	08 Mar 2010
Updated	08 Mar 2010
Type	Conference
Year	2009
Where	ECIR
Authors	Ben Carterette, Virgiliu Pavlu, Evangelos Kanoulas, Javed A. Aslam, James Allan

Comments (0)

Sciweavers

If I Had a Million Queries

Computer Science | ECIR 2009 | Larger Sample | Relevance Judgments | Sparse Judgments |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers