Multilayer model for Arabic text compression

15 years 1 months ago

Download www.ccis2k.org

: This article describes a multilayer model-based approach for text compression. It uses linguistic information to develop a multilayer decomposition model of the text in order to achieve better compression. This new approach is illustrated for the case of the Arabic language, where the majority of words are generated according to the Semitic root-and-pattern scheme. Text is split into three linguistically homogeneous layers representing the three categories of words: derivative, nonderivative and functional words. A fourth layer, called the Mask, is introduced to aid with the reconstruction of the original text from the three layers in the decoding side. Suitable compression techniques are then applied to the different layers in order to maximize the compression ratio. The proposed method has been evaluated in terms of the rate of compression it provides and its time efficiency. Results are shown along with real texts to illustrate the performance of the new approach. The novelties of...

Arafat Awajan

Real-time Traffic

Compression Techniques | Distributed And Parallel Computing | IAJIT 2011 | Multilayer Model-based Approach | Suitable Compression Techniques |

claim paper

Added	14 May 2011
Updated	14 May 2011
Type	Journal
Year	2011
Where	IAJIT
Authors	Arafat Awajan

Sciweavers

Multilayer model for Arabic text compression

Compression Techniques | Distributed And Parallel Computing | IAJIT 2011 | Multilayer Model-based Approach | Suitable Compression Techniques |

Explore & Download

Productivity Tools

Sciweavers