Sciweavers

DCC
2003
IEEE

A Dictionary-Based Multi-Corpora Text Compression System

14 years 10 months ago
A Dictionary-Based Multi-Corpora Text Compression System
In this paper we introduce StarZip, a multi-corpora lossless text compression utility which incorporates StarNT, our newly proposed transform algorithm. StarNT is a dictionary-based fast lossless text transform algorithm which utilizes ternary search tree to expedite transform encoding. For large files, viz. 400 Kbytes or more, our experiments show that the compression time is no worse than those obtained by bzip2 and gzip, and much faster than PPMD. However, if the file size is small, our algorithm is 28.1% and
Weifeng Sun, Nan Zhang 0005, Amar Mukherjee
Added 25 Dec 2009
Updated 25 Dec 2009
Type Conference
Year 2003
Where DCC
Authors Weifeng Sun, Nan Zhang 0005, Amar Mukherjee
Comments (0)