Relaxed Cross-lingual Projection of Constituent Syntax

13 years 2 months ago

Download www.nlp.org.cn

We propose a relaxed correspondence assumption for cross-lingual projection of constituent syntax, which allows a supposed constituent of the target sentence to correspond to an unrestricted treelet in the source parse. Such a relaxed assumption fundamentally tolerates the syntactic non-isomorphism between languages, and enables us to learn the target-language-speciﬁc syntactic idiosyncrasy rather than a strained grammar directly projected from the source language syntax. Based on this assumption, a novel constituency projection method is also proposed in order to induce a projected constituent treebank from the source-parsed bilingual corpus. Experiments show that, the parser trained on the projected treebank dramatically outperforms previous projected and unsupervised parsers.

Wenbin Jiang, Qun Liu, Yajuan Lv

Real-time Traffic

EMNLP 2011 | Language Syntax | Natural Language Processing | Projection Method | Target Language |

claim paper

Post Info
More Details (n/a)

Added	20 Dec 2011
Updated	20 Dec 2011
Type	Journal
Year	2011
Where	EMNLP
Authors	Wenbin Jiang, Qun Liu, Yajuan Lv

Comments (0)

Sciweavers

Relaxed Cross-lingual Projection of Constituent Syntax

EMNLP 2011 | Language Syntax | Natural Language Processing | Projection Method | Target Language |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers