Sciweavers

LREC
2010

Construction of Chunk-Aligned Bilingual Lecture Corpus for Simultaneous Machine Translation

14 years 28 days ago
Construction of Chunk-Aligned Bilingual Lecture Corpus for Simultaneous Machine Translation
With the development of speech and language processing, speech translation systems have been developed. These studies target spoken dialogues, and employ consecutive interpretation, which uses a sentence as the translation unit. On the other hand, there exist a few researches about simultaneous interpreting, and recently, the language resources for promoting simultaneous interpreting research, such as the publication of an analytical large-scale corpus, has been prepared. For the future, it is necessary to make the corpora more practical toward realization of a simultaneous interpreting system. In this paper, we describe the construction of a bilingual corpus which can be used for simultaneous lecture interpreting research. Simultaneous lecture interpreting systems are required to recognize translation units in the middle of a sentence, and generate its translation at the proper timing. We constructed the bilingual lecture corpus by the following steps. First, we segmented sentences i...
Masaki Murata, Tomohiro Ohno, Shigeki Matsubara, Y
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2010
Where LREC
Authors Masaki Murata, Tomohiro Ohno, Shigeki Matsubara, Yasuyoshi Inagaki
Comments (0)