Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

185

CIARP
2006
Springer

170views Pattern Recognition» more CIARP 2006»

Alignment of Paragraphs in Bilingual Texts Using Bilingual Dictionaries and Dynamic Programming

15 years 10 months ago

Alignment of Paragraphs in Bilingual Texts Using Bilingual Dictionaries and Dynamic Programming

Download www.cic.ipn.mx

Parallel text alignment is a special type of pattern recognition task aimed to discover the similarity between two sequences of symbols. Given the same text in two different languages, the task is to decide which elements--paragraphs in case of paragraph alignment---in one text are translations of which elements of the other text. One of the applications is training training statistical machine translation algorithms. The task is not trivial unless detailed text understanding can be afforded. In our previous work we have presented a simple technique that relied on bilingual dictionaries but does not perform any syntactic analysis of the texts. In this paper we give a formal definition of the task and present an exact optimization algorithm for finding the best alignment.

Alexander F. Gelbukh, Grigori Sidorov

Real-time Traffic

CIARP 2006 | Detailed Text Understanding | Parallel Text Alignment | Pattern Recognition | Pattern Recognition Task |

claim paper

Related Content

» Bilingual Text Matching using Bilingual Dictionary and Statistics

» Bilingual Knowledge Acquisition from KoreanEnglish Parallel Corpus Using Alignment

» HighPerformance Bilingual Text Alignment Using Statistical and Dictionary Information

» Creating a Reusable EnglishChinese Parallel Corpus for Bilingual Dictionary Construction

» Learning Translation Templates From Bilingual Text

» Automatic extraction of translations from webbased bilingual materials

» Segmentation and alignment of parallel text for statistical machine translation

» Combining Sentence Length with Location Information to Align Monolingual Parallel Texts

» Unsupervised Multilingual Grammar Induction

Post Info
More Details (n/a)

Added	20 Aug 2010
Updated	20 Aug 2010
Type	Conference
Year	2006
Where	CIARP
Authors	Alexander F. Gelbukh, Grigori Sidorov

Comments (0)