Sciweavers

DMIN
2006

Arabic Text Classification Using N-Gram Frequency Statistics A Comparative Study

14 years 29 days ago
Arabic Text Classification Using N-Gram Frequency Statistics A Comparative Study
This paper presents the results of classifying Arabic text documents using the N-gram frequency statistics technique employing a dissimilarity measure called the "Manhattan distance", and Dice's measure of similarity. The Dice measure was used for comparison purposes. Results show that N-gram text classification using the Dice measure outperforms classification using the Manhattan measure.
Laila Khreisat
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2006
Where DMIN
Authors Laila Khreisat
Comments (0)