Long Sentence Partitioning using Structure Analysis for Machine Translation

15 years 11 months ago

Download www.afnlp.org

in machine translation, long sentences are usually assumed to be difficult to treat. The main reason is the syntactic ambiguity which increases explosively as a sentence become longer. Especially, in the machine translation using sentence patterns, a long sentence causes a critical coverage problem. In this paper, we present a method of sentence partitioning which recognizes sub-sentence ranges by structure analysis, reducing the length of a sentence for translation. For the analysis of the clausal structure, phrase-level sentence patterns which have only a little syntactic ambiguities are employed. The structure analysis is conducted by the recognition of starting points of all clauses, dependency analysis, and depth analysis. Then, the ranges of sub-sentences are extracted based on the depth by stages. Our method was evaluated on 108 sentences extracted from CNN transcripts. It showed 85.2% accuracy in the detection of simple sentences.

Yoon-Hyung Roh, Young Ae Seo, Ki-Young Lee, Sung-K

Real-time Traffic

Long Sentence | Machine Translation | Natural Language Processing | NLPRS 2001 | Sentence Patterns |

claim paper

» Symmetric Pattern Matching Analysis for English Coordinate Structures

» PASBio predicateargument structures for event extraction in molecular biology

» A Criticality Analysis of Clustering in Superscalar Processors

» Conotoxin Protein Classification Using Free Scores of Words and Support Vector Machines

Post Info
More Details (n/a)

Added	30 Jul 2010
Updated	30 Jul 2010
Type	Conference
Year	2001
Where	NLPRS
Authors	Yoon-Hyung Roh, Young Ae Seo, Ki-Young Lee, Sung-Kwon Choi

Comments (0)

Sciweavers

Long Sentence Partitioning using Structure Analysis for Machine Translation

Long Sentence | Machine Translation | Natural Language Processing | NLPRS 2001 | Sentence Patterns |

Explore & Download

Productivity Tools

Sciweavers