Sciweavers

WIA
2005
Springer

Compressing XML Documents Using Recursive Finite State Automata

14 years 5 months ago
Compressing XML Documents Using Recursive Finite State Automata
Abstract. We propose a scheme for automatically generating compressors for XML documents from Document Type Definition(DTD) specifications. Our algorithm is a lossless adaptive algorithm where the model used for compression and decompression is generated automatically from the DTD, and is used in conjunction with an arithmetic compressor to produce a compressed version of the document. The structure of the model mirrors the syntactic specification of the document. Our compression scheme is on-line, that is, it can compress the document as it is being read. We have implemented the compressor generator, and provide the results of experiments on some large XML databases whose DTD’s are specified. We note that the average compression is better than that of XMLPPM, the only other on-line tool we are aware of. The tool is able to compress massive documents where XMLPPM failed to work as it ran out of memory. We believe the main appeal of this technique is the fact that the underlying m...
Hariharan Subramanian, Priti Shankar
Added 28 Jun 2010
Updated 28 Jun 2010
Type Conference
Year 2005
Where WIA
Authors Hariharan Subramanian, Priti Shankar
Comments (0)