Though attention to evaluating human-robot interfaces has increased in recent years, there are relatively few reports of using evaluation tools during the development of humanrobo...
We propose an automatic machine translation (MT) evaluation metric that calculates a similarity score (based on precision and recall) of a pair of sentences. Unlike most metrics, ...
In this paper we present Evaluate, a platform for learning performance monitoring. Evaluate manages a number of artefacts that can be used to monitor learning performance, like met...
Bernd Simon, Kasra Seirafi, Asmund Realfsen, Mark ...
This paper investigates a new evaluation method for assessing the coherence of computer-aided summaries, justified by the inappropriacy of existing evaluation methods for this tas...
In this paper we analyse data from the SemEval lexical substitution task in those cases where the annotators indicated that the target word was part of a phrase before substitutin...