Character-Level Machine Translation Evaluation for Languages with Ambiguous Word Boundaries

13 years 9 months ago

Download www.aclweb.org

In this work, we introduce the TESLACELAB metric (Translation Evaluation of Sentences with Linear-programming-based Analysis – Character-level Evaluation for Languages with Ambiguous word Boundaries) for automatic machine translation evaluation. For languages such as Chinese where words usually have meaningful internal structure and word boundaries are often fuzzy, TESLA-CELAB acknowledges the advantage of character-level evaluation over word-level evaluation. By reformulating the problem in the linear programming framework, TESLACELAB addresses several drawbacks of the character-level metrics, in particular the modeling of synonyms spanning multiple characters. We show empirically that TESLACELAB signiﬁcantly outperforms characterlevel BLEU in the English-Chinese translation evaluation tasks.

Chang Liu, Hwee Tou Ng

Real-time Traffic

ACL 2012 | Automatic Machine Translation | Computational Linguistics | English Chinese Translation | Level Metrics |

claim paper

» A Hybrid MorphemeWord Representation for Machine Translation of Morphologically Rich Langu...

» Combining Morphemebased Machine Translation with Postprocessing Morpheme Prediction

» Morphological Richness Offsets Resource Demand Experiences in Constructing a POS Tagger f...

Post Info
More Details (n/a)

Added	29 Sep 2012
Updated	29 Sep 2012
Type	Journal
Year	2012
Where	ACL
Authors	Chang Liu, Hwee Tou Ng

Comments (0)

Sciweavers

Character-Level Machine Translation Evaluation for Languages with Ambiguous Word Boundaries

ACL 2012 | Automatic Machine Translation | Computational Linguistics | English Chinese Translation | Level Metrics |

Explore & Download

Productivity Tools

Sciweavers