Sciweavers

LREC
2008

Identification of Naturally Occurring Numerical Expressions in Arabic

14 years 28 days ago
Identification of Naturally Occurring Numerical Expressions in Arabic
In this paper, we define the task of Number Identification in natural context. We present and validate a language-independent semiautomatic approach to quickly building a gold standard for evaluating number identification systems by exploiting hand-aligned parallel data. We also present and extensively evaluate a robust rule-based system for number identification in natural context for Arabic for a variety of number formats and types. The system is shown to have strong performance, achieving, on a blind test, a 94.8% F-score for the task of correctly identifying number expression spans in natural text, and a 92.1% F-score for the task of correctly determining the core numerical value.
Nizar Habash, Ryan Roth
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where LREC
Authors Nizar Habash, Ryan Roth
Comments (0)