Sciweavers

EMNLP
2008

Arabic Named Entity Recognition using Optimized Feature Sets

14 years 1 months ago
Arabic Named Entity Recognition using Optimized Feature Sets
The Named Entity Recognition (NER) task has been garnering significant attention in NLP as it helps improve the performance of many natural language processing applications. In this paper, we investigate the impact of using different sets of features in two discriminative machine learning frameworks, namely, Support Vector Machines and Conditional Random Fields using Arabic data. We explore lexical, contextual and morphological features on eight standardized data-sets of different genres. We measure the impact of the different features in isolation, rank them according to their impact for each named entity class and incrementally combine them in order to infer the optimal machine learning approach and feature set. Our system yields a performance of F=1-measure=83.5 on ACE 2003 Broadcast News data.
Yassine Benajiba, Mona T. Diab, Paolo Rosso
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where EMNLP
Authors Yassine Benajiba, Mona T. Diab, Paolo Rosso
Comments (0)