Urdu and Hindi: Translation and sharing of linguistic resources

15 years 1 months ago

Download www.aclweb.org

Hindi and Urdu share a common phonology, morphology and grammar but are written in different scripts. In addition, the vocabularies have also diverged significantly especially in the written form. In this paper we show that we can get reasonable quality translations (we estimated the Translation Error rate at 18%) between the two languages even in absence of a parallel corpus. Linguistic resources such as treebanks, part of speech tagged data and parallel corpora with English are limited for both these languages. We use the translation system to share linguistic resources between the two languages. We demonstrate improvements on three tasks and show: statistical machine translation from Urdu to English is improved (0.8 in BLEU score) by using a Hindi-English parallel corpus, Hindi part of speech tagging is improved (upto 6% absolute) by using an Urdu part of speech corpus and a Hindi-English word aligner is improved by using a manually word aligned UrduEnglish corpus (upto 9% absolute...

Karthik Visweswariah, Vijil Chenthamarakshan, Nand

Real-time Traffic

COLING 2010 | Computational Linguistics | Hindi-English Parallel Corpus | Linguistic Resources | Parallel Corpus |

claim paper

» Finitestate Scriptural Translation

» LCSTAR II Starring more Lexica

» Clustering of Terms from Translation Dictionaries and Synonyms Lists to Automatically Buil...

» Linguistic Resources for Reconstructing Spontaneous Speech Text

» CrossLanguage Frame Semantics Transfer in Bilingual Corpora

Post Info
More Details (n/a)

Added	13 May 2011
Updated	13 May 2011
Type	Journal
Year	2010
Where	COLING
Authors	Karthik Visweswariah, Vijil Chenthamarakshan, Nandakishore Kambhatla

Comments (0)

Sciweavers

Urdu and Hindi: Translation and sharing of linguistic resources

COLING 2010 | Computational Linguistics | Hindi-English Parallel Corpus | Linguistic Resources | Parallel Corpus |

Explore & Download

Productivity Tools

Sciweavers