Evaluating Machine Translation Utility via Semantic Role Labels

15 years 8 months ago

Download www.lrec-conf.org

We present the methodology that underlies new metrics for semantic machine translation evaluation that we are developing. Unlike widely-used lexical and n-gram based MT evaluation metrics, the aim of semantic MT evaluation is to measure the utility of translations. We discuss the design of empirical studies to evaluate the utility of machine translation output by assessing the accuracy for key semantic roles. Such roles can be annotated using Propbank-style PRED and ARG labels. Recent work by Wu and Fung (2009) introduced methods based on automatic semantic role labeling into statistical machine translation, to enhance the quality of MT output. However, semantic SMT approaches have so far still only been evaluated using lexical and n-gram based SMT evaluation metrics such as BLEU, which are not aimed at evaluating the utility of MT output. Direct data analysis is still needed to understand how semantic models can be leveraged to evaluate the utility of MT output. In this paper, we dis...

Chi-kiu Lo, Dekai Wu

Real-time Traffic

Education | LREC 2010 | Machine Translation | Machine Translation Output | Semantic |

claim paper

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Chi-kiu Lo, Dekai Wu

Sciweavers

Evaluating Machine Translation Utility via Semantic Role Labels

Education | LREC 2010 | Machine Translation | Machine Translation Output | Semantic |

Explore & Download

Productivity Tools

Sciweavers