Sciweavers

229 search results - page 5 / 46
» Evaluation measures for preference judgments
Sort
View
SIGIR
2010
ACM
14 years 8 days ago
Do user preferences and evaluation measures line up?
Mark Sanderson, Monica Lestari Paramita, Paul Clou...
ACL
2012
11 years 10 months ago
PORT: a Precision-Order-Recall MT Evaluation Metric for Tuning
Many machine translation (MT) evaluation metrics have been shown to correlate better with human judgment than BLEU. In principle, tuning on these metrics should yield better syste...
Boxing Chen, Roland Kuhn, Samuel Larkin
PKDD
2010
Springer
183views Data Mining» more  PKDD 2010»
13 years 6 months ago
Fast Active Exploration for Link-Based Preference Learning Using Gaussian Processes
Abstract. In preference learning, the algorithm observes pairwise relative judgments (preference) between items as training data for learning an ordering of all items. This is an i...
Zhao Xu, Kristian Kersting, Thorsten Joachims
ACL
2008
13 years 10 months ago
Assessing Dialog System User Simulation Evaluation Measures Using Human Judges
Previous studies evaluate simulated dialog corpora using evaluation measures which can be automatically extracted from the dialog systems' logs. However, the validity of thes...
Hua Ai, Diane J. Litman
CIKM
2007
Springer
14 years 2 months ago
Semiautomatic evaluation of retrieval systems using document similarities
Taking advantage of the well-known cluster hypothesis that “closely associated documents tend to be relevant to the same request”, we can use inter-document similarity to prov...
Ben Carterette, James Allan