Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

175

AMTA
2004
Springer

218views Information Technology» more AMTA 2004»

A Structurally Diverse Minimal Corpus for Eliciting Structural Mappings Between Languages

15 years 12 months ago

A Structurally Diverse Minimal Corpus for Eliciting Structural Mappings Between Languages

Download www.cs.cmu.edu

Abstract. We describe an approach to creating a small but diverse corpus in English that can be used to elicit information about any target language. The focus of the corpus is on structural information. The resulting bilingual corpus can then be used for natural language processing tasks such as inferring transfer mappings for Machine Translation. The corpus is suﬃciently small that a bilingual user can translate and wordalign it within a matter of hours. We describe how the corpus is created and how its structural diversity is ensured. We then argue that it is not necessary to introduce a large amount of redundancy into the corpus. This is shown by creating an increasingly redundant corpus and observing that the information gained converges as redundancy increases.1

Katharina Probst, Alon Lavie

Real-time Traffic

AMTA 2004 | Bilingual Corpus | Diverse Corpus | Information Management | Redundant Corpus |

claim paper

Related Content

» Mapping between Dependency Structures and Compositional Semantic Representations

» The Cambridge CookieTheft Corpus A Corpus of Directed and Spontaneous Speech of BrainDamag...

» DL Meet FL A Bidirectional Mapping between Ontologies and Linguistic Knowledge

» Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation

» Toward understanding natural language directions

» Brain Morphometry by Distance Measurement in a NonEuclidean Curvilinear Space

» Ontological Smoothing for Relation Extraction with Minimal Supervision

» Interface Terminologies Bridging the Gap between Theory and Reality for Africa

» Automatic classification of question turns in spontaneous speech using lexical and prosodi...

Post Info
More Details (n/a)

Added	30 Jun 2010
Updated	30 Jun 2010
Type	Conference
Year	2004
Where	AMTA
Authors	Katharina Probst, Alon Lavie

Comments (0)