We describe a low-complexity scheme for lossless compression of short text messages. The method uses arithmetic coding and a specific statistical context model for prediction of single symbols. Our particular contribution is a simple yet effective approach for storing highly complex statistics in a succinct yet effective data model that can easily be trained by text data. The proposed model already gives good compression rates with a RAM memory size of 128 kByte, thus making lossless data compression with statistical context modeling readily applicable to small devices like wireless sensors or mobile phones.
Stephan Rein, Clemens Gühmann, Frank H. P. Fi