Automatic Discovery of Named Entity Variants: Grammar-driven Approaches to Non-Alphabetical Transliterations

15 years 9 months ago

Download cwn.ling.sinica.edu.tw

Identiﬁcation of transliterated names is a particularly difﬁcult task of Named Entity Recognition (NER), especially in the Chinese context. Of all possible variations of transliterated named entities, the difference between PRC and Taiwan is the most prevalent and most challenging. In this paper, we introduce a novel approach to the automatic extraction of diverging transliterations of foreign named entities by bootstrapping cooccurrence statistics from tagged and segmented Chinese corpus. Preliminary experiment yields promising results and shows its potential in NLP applications.

Chu-Ren Huang, Petr Simon, Shu-Kai Hsieh

Real-time Traffic

ACL 2007 | Computational Linguistics | Foreign Named Entities | Named Entity Recognition | Transliterated Named Entities |

claim paper

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	ACL
Authors	Chu-Ren Huang, Petr Simon, Shu-Kai Hsieh

Sciweavers

Automatic Discovery of Named Entity Variants: Grammar-driven Approaches to Non-Alphabetical Transliterations

ACL 2007 | Computational Linguistics | Foreign Named Entities | Named Entity Recognition | Transliterated Named Entities |

Explore & Download

Productivity Tools

Sciweavers