Sciweavers

7 search results - page 1 / 2
» Relevance judgments between TREC and Non-TREC assessors
Sort
View
SIGIR
2008
ACM
13 years 11 months ago
Relevance judgments between TREC and Non-TREC assessors
This paper investigates the agreement of relevance assessments between official TREC judgments and those generated from an interactive IR experiment. Results show that 63% of docu...
Azzah Al-Maskari, Mark Sanderson, Paul Clough
SIGIR
2012
ACM
12 years 1 months ago
Effect of written instructions on assessor agreement
Assessors frequently disagree on the topical relevance of documents. How much of this disagreement is due to ambiguity in assessment instructions? We have two assessors assess TRE...
William Webber, Bryan Toth, Marjorie Desamito
SIGIR
2008
ACM
13 years 11 months ago
Evaluation over thousands of queries
Information retrieval evaluation has typically been performed over several dozen queries, each judged to near-completeness. There has been a great deal of recent work on evaluatio...
Ben Carterette, Virgiliu Pavlu, Evangelos Kanoulas...
SIGIR
2008
ACM
13 years 11 months ago
Evaluation measures for preference judgments
There has been recent interest in collecting user or assessor preferences, rather than absolute judgments of relevance, for the evaluation or learning of ranking algorithms. Since...
Ben Carterette, Paul N. Bennett
CIKM
2007
Springer
14 years 3 months ago
Hypothesis testing with incomplete relevance judgments
Information retrieval experimentation generally proceeds in a cycle of development, evaluation, and hypothesis testing. Ideally, the evaluation and testing phases should be short ...
Ben Carterette, Mark D. Smucker