Decomposability of Translation Metrics for Improved Evaluation and Efficient Algorithms

15 years 3 months ago

Download www.comp.nus.edu.sg

BLEU is the de facto standard for evaluation and development of statistical machine translation systems. We describe three real-world situations involving comparisons between different versions of the same systems where one can obtain improvements in BLEU scores that are questionable or even absurd. These situations arise because BLEU lacks the property of decomposability, a property which is also computationally convenient for various applications. We propose a very conservative modification to BLEU and a cross between BLEU and word error rate that address these issues while improving correlation with human judgments.

David Chiang, Steve DeNeefe, Yee Seng Chan, Hwee T

Real-time Traffic

BLEU | BLEU Scores | EMNLP 2008 | Natural Language Processing | Situations Involving Comparisons |

claim paper

» Automatic Evaluation Measures for Statistical Machine Translation System Optimization

» Decomposing objectoriented class modules using an agglomerative clustering technique

» Improved WordLevel System Combination for Machine Translation

» Discriminative Reranking for Machine Translation

» Learning to Improve both Efficiency and Quality of Planning

» Evaluating automatic parallelization for efficient execution on sharedmemory multiprocesso...

» XMLBased RDF Data Management for Efficient Query Processing

» Unsupervised Search for the Optimal Segmentation for Statistical Machine Translation

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	EMNLP
Authors	David Chiang, Steve DeNeefe, Yee Seng Chan, Hwee Tou Ng

Comments (0)

Sciweavers

Decomposability of Translation Metrics for Improved Evaluation and Efficient Algorithms

BLEU | BLEU Scores | EMNLP 2008 | Natural Language Processing | Situations Involving Comparisons |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers