Sciweavers

229 search results - page 6 / 46
» Evaluation measures for preference judgments
Sort
View
AMTA
2004
Springer
14 years 4 days ago
The Significance of Recall in Automatic Metrics for MT Evaluation
Recent research has shown that a balanced harmonic mean (F1 measure) of unigram precision and recall outperforms the widely used BLEU and NIST metrics for Machine Translation evalu...
Alon Lavie, Kenji Sagae, Shyamsundar Jayaraman
SIGIR
2005
ACM
14 years 1 months ago
Accurately interpreting clickthrough data as implicit feedback
This paper examines the reliability of implicit feedback generated from clickthrough data in WWW search. Analyzing the users’ decision process using eyetracking and comparing im...
Thorsten Joachims, Laura A. Granka, Bing Pan, Hele...
CIKM
2011
Springer
12 years 8 months ago
A probabilistic method for inferring preferences from clicks
Evaluating rankers using implicit feedback, such as clicks on documents in a result list, is an increasingly popular alternative to traditional evaluation methods based on explici...
Katja Hofmann, Shimon Whiteson, Maarten de Rijke
ACL
2010
13 years 6 months ago
Evaluating Machine Translations Using mNCD
This paper introduces mNCD, a method for automatic evaluation of machine translations. The measure is based on normalized compression distance (NCD), a general information theoret...
Marcus Dobrinkat, Tero Tapiovaara, Jaakko Väy...
DAWAK
2008
Springer
13 years 10 months ago
The Evaluation of Sentence Similarity Measures
The ability to accurately judge the similarity between natural language sentences is critical to the performance of several applications such as text mining, question answering, an...
Palakorn Achananuparp, Xiaohua Hu, Xiajiong Shen