Simultaneous Tokenization and Part-Of-Speech Tagging for Arabic without a Morphological Analyzer

14 years 1 months ago

Download aclweb.org

We describe an approach to simultaneous tokenization and part-of-speech tagging that is based on separating the closed and open-class items, and focusing on the likelihood of the possible stems of the openclass words. By encoding some basic linguistic information, the machine learning task is simplified, while achieving stateof-the-art tokenization results and competitive POS results, although with a reduced tag set and some evaluation difficulties.

Seth Kulick

Real-time Traffic

ACL 2010 | Basic Linguistic Information | Computational Linguistics | Reduced Tag | Simultaneous Tokenization |

claim paper

Post Info
More Details (n/a)

Added	10 Feb 2011
Updated	10 Feb 2011
Type	Journal
Year	2010
Where	ACL
Authors	Seth Kulick

Comments (0)

Sciweavers

Simultaneous Tokenization and Part-Of-Speech Tagging for Arabic without a Morphological Analyzer

ACL 2010 | Basic Linguistic Information | Computational Linguistics | Reduced Tag | Simultaneous Tokenization |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers