Applying Automated Metrics to Speech Translation Dialogs

15 years 8 months ago

Download www.lrec-conf.org

Over the past five years, the Defense Advanced Research Projects Agency (DARPA) has funded development of speech translation systems for tactical applications. A key component of the research program has been extensive system evaluation, with dual objectives of assessing progress overall and comparing among systems. This paper describes the methods used to obtain BLEU, TER, and METEOR scores for two-way English-Iraqi Arabic systems. We compare the scores with measures based on human judgments and demonstrate the effects of normalization operations on BLEU scores. Issues that are highlighted include the quality of test data and differential results of applying automated metrics to Arabic vs. English.

Sherri L. Condon, Jon Phillips, Christy Doran, Joh

Real-time Traffic

Defense Advanced Research Projects Agency | Education | English-Iraqi Arabic Systems | LREC 2008 | Speech Translation Systems |

claim paper

» Applying Machine Translation Evaluation Techniques to Textual CBR

» Translation Adequacy and Preference Evaluation Tool TAPET

» Dynamic noise analysis in prechargeevaluate circuits

» A performance tuning methodology with compiler support

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	LREC
Authors	Sherri L. Condon, Jon Phillips, Christy Doran, John S. Aberdeen, Dan Parvaz, Beatrice T. Oshika, Greg Sanders, Craig Schlenoff

Comments (0)

Sciweavers

Applying Automated Metrics to Speech Translation Dialogs

Defense Advanced Research Projects Agency | Education | English-Iraqi Arabic Systems | LREC 2008 | Speech Translation Systems |

Explore & Download

Productivity Tools

Sciweavers