Extending the BLEU MT Evaluation Method with Frequency Weightings

15 years 8 months ago

Download acl.ldc.upenn.edu

We present the results of an experiment on extending the automatic method of Machine Translation evaluation BLUE with statistical weights for lexical items, such as tf.idf scores. We show that this extension gives additional information about evaluated texts; in particular it allows us to measure translation Adequacy, which, for statistical MT systems, is often overestimated by the baseline BLEU method. The proposed model uses a single human reference translation, which increases the usability of the proposed method for practical purposes. The model suggests a linguistic interpretation which relates frequency weights and human intuition about translation Adequacy and Fluency.

Bogdan Babych, Tony Hartley

Real-time Traffic

ACL 2004 | ACL 2007 | Baseline Bleu Method | Machine Translation Evaluation | Translation Adequacy |

claim paper

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2004
Where	ACL
Authors	Bogdan Babych, Tony Hartley

Sciweavers

Extending the BLEU MT Evaluation Method with Frequency Weightings

ACL 2004 | ACL 2007 | Baseline Bleu Method | Machine Translation Evaluation | Translation Adequacy |

Explore & Download

Productivity Tools

Sciweavers