Sciweavers

ACL
2007

Automatic Discovery of Named Entity Variants: Grammar-driven Approaches to Non-Alphabetical Transliterations

14 years 1 months ago
Automatic Discovery of Named Entity Variants: Grammar-driven Approaches to Non-Alphabetical Transliterations
Identification of transliterated names is a particularly difficult task of Named Entity Recognition (NER), especially in the Chinese context. Of all possible variations of transliterated named entities, the difference between PRC and Taiwan is the most prevalent and most challenging. In this paper, we introduce a novel approach to the automatic extraction of diverging transliterations of foreign named entities by bootstrapping cooccurrence statistics from tagged and segmented Chinese corpus. Preliminary experiment yields promising results and shows its potential in NLP applications.
Chu-Ren Huang, Petr Simon, Shu-Kai Hsieh
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2007
Where ACL
Authors Chu-Ren Huang, Petr Simon, Shu-Kai Hsieh
Comments (0)