Sciweavers

ACL
1998

Splitting Long or Ill-formed Input for Robust Spoken-language Translation

14 years 1 months ago
Splitting Long or Ill-formed Input for Robust Spoken-language Translation
This paper proposes an input-splitting method for translating spoken-language which includes many long or ill-formed expressions. The proposed method splits input into well-balanced translation units based on a semantic distance calculation. The splitting is performed during left-to-right parsing, and does not degrade translation efficiency. The complete translation result is formed by concatenating the partial translation results of each split unit. The proposed method can be incorporated into frameworks like TDMT, which utilize left-to-right parsing and a score for a substructure. Experimental results show that the proposed method gives TDMT the followingadvantages: (1) elimination of null outputs, (2) splitting of utterances into sentences, and (3) robust translation of erroneous speech recognition results.
Osamu Furuse, Setsuo Yamada, Kazuhide Yamamoto
Added 01 Nov 2010
Updated 01 Nov 2010
Type Conference
Year 1998
Where ACL
Authors Osamu Furuse, Setsuo Yamada, Kazuhide Yamamoto
Comments (0)