The Smith-Waterman algorithm for local sequence alignment is one of the most important techniques in computational molecular biology. This ingenious dynamic programming approach was designed to reveal the highly conserved fragments by discarding poorly conserved initial and terminal segments. However, the existing notion of local similarity has a serious aw: it does not discard poorly conserved intermediate segments. The Smith-Waterman algorithm nds the local alignment with maximal score but it is unable to nd local alignment with maximum degree of similarity (e.g., maximal percent of matches). Moreover, there is still no e cient algorithm that answers the following natural question: do two sequences share a (su ciently long) fragment with more than 70% of similarity? As a result, the local alignment sometimes produces a mosaic of well-conserved fragments arti cially connected by poorly-conserved or even unrelated fragments. This may lead to problems in comparison of long genomic sequ...
Abdullah N. Arslan, Ömer Egecioglu, Pavel A.