Sciweavers

ACSC
2002
IEEE

Enhanced Word-Based Block-Sorting Text Compression

14 years 5 months ago
Enhanced Word-Based Block-Sorting Text Compression
The Block Sorting process of Burrows and Wheeler can be applied to any sequence in which symbols are (or might be) conditioned upon each other. In particular, it is possible to parse text into a stream of words, and then employ block sorting to identify and so exploit any conditioning relationships between words. In this paper we build upon the previous work of two of the authors, describing several further recency rank transformations, and considering also the role of the entropy coder. By combining the best of the new recency transformations with an entropy coder that conditions ranks upon gross characteristics of previous ones, we are able to obtain improved compression on typical text files.
R. Yugo Kartono Isal, Alistair Moffat, A. C. H. Ng
Added 14 Jul 2010
Updated 14 Jul 2010
Type Conference
Year 2002
Where ACSC
Authors R. Yugo Kartono Isal, Alistair Moffat, A. C. H. Ngai
Comments (0)