Sciweavers

RECOMB
2000
Springer

A compression algorithm for DNA sequences and its applications in genome comparison

14 years 3 months ago
A compression algorithm for DNA sequences and its applications in genome comparison
We present a lossless compression algorithm, GenCompress, for genetic sequences, based on searching for approximate repeats. Our algorithm achieves the best compression ratios for benchmark DNA sequences. Signi cantly better compression results show that the approximate repeats are one of the main hidden regularities in DNA sequences. We then describe a theory of measuring the relatedness between two DNA sequences. Using our algorithm, we present strong experimental support for this theory, and demonstrate its application in comparing genomes and constructing evolutionary trees.
Xin Chen, Sam Kwong, Ming Li
Added 25 Aug 2010
Updated 25 Aug 2010
Type Conference
Year 2000
Where RECOMB
Authors Xin Chen, Sam Kwong, Ming Li
Comments (0)