Sciweavers

32 search results - page 1 / 7
» Building a filtering test collection for TREC 2002
Sort
View
78
Voted
SIGIR
2003
ACM
15 years 8 months ago
Building a filtering test collection for TREC 2002
Ian Soboroff, Stephen E. Robertson
124
Voted
NAACL
2003
15 years 4 months ago
Evaluating the Evaluation: A Case Study Using the TREC 2002 Question Answering Track
Evaluating competing technologies on a common problem set is a powerful way to improve the state of the art and hasten technology transfer. Yet poorly designed evaluations can was...
Ellen M. Voorhees
128
Voted
DELOS
2007
15 years 4 months ago
INEX 2002 - 2006: Understanding XML Retrieval Evaluation
Evaluating the effectiveness of XML retrieval requires building test collections where the evaluation paradigms are provided according to criteria that take into account structural...
Mounia Lalmas, Anastasios Tombros
105
Voted
SIGIR
2002
ACM
15 years 2 months ago
Liberal relevance criteria of TREC -: counting on negligible documents?
Most test collections (like TREC and CLEF) for experimental research in information retrieval apply binary relevance assessments. This paper introduces a four-point relevance scal...
Eero Sormunen
124
Voted
SIGIR
2000
ACM
15 years 7 months ago
Building a question answering test collection
The TREC-8 Question Answering (QA) Track was the first large-scale evaluation of domain-independent question answering systems. In addition to fostering research on the QA task, ...
Ellen M. Voorhees, Dawn M. Tice