Sciweavers

ACL
2007

Bootstrapping a Stochastic Transducer for Arabic-English Transliteration Extraction

14 years 1 months ago
Bootstrapping a Stochastic Transducer for Arabic-English Transliteration Extraction
We propose a bootstrapping approach to training a memoriless stochastic transducer for the task of extracting transliterations from an English-Arabic bitext. The transducer learns its similarity metric from the data in the bitext, and thus can function directly on strings written in different writing scripts without any additional language knowledge. We show that this bootstrapped transducer performs as well or better than a model designed specifically to detect Arabic-English transliterations.
Tarek Sherif, Grzegorz Kondrak
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2007
Where ACL
Authors Tarek Sherif, Grzegorz Kondrak
Comments (0)