Efficient Seeding Techniques for Protein Similarity Search

15 years 8 months ago

Download hal.archives-ouvertes.fr

Abstract. We apply the concept of subset seeds proposed in [1] to similarity search in protein sequences. The main question studied is the design of efficient seed alphabets to construct seeds with optimal sensitivity/selectivity trade-offs. We propose several different design methods and use them to construct several alphabets. We then perform an analysis of seeds built over those alphabet and compare them with the standard Blastp seeding method [2,3], as well as with the family of vector seeds proposed in [4]. While the formalism of subset seed is less expressive (but less costly to implement) than the accumulative principle used in Blastp and vector seeds, our seeds show a similar or even better performance than Blastp on Bernoulli models of proteins compatible with the common BLOSUM62 matrix.

Mikhail A. Roytberg, Anna Gambin, Laurent No&eacut

Real-time Traffic

Bioinformatics | BIRD 2008 | Efficient Seed Alphabets | Subset Seed | Vector Seeds |

claim paper

» Designing seeds for similarity search in genomic DNA

» Optimal neighborhood indexing for protein similarity search

» Designing multiple simultaneous seeds for DNA similarity search

» Efficient Video Similarity Measurement and Search

» PSI indexing protein structures for fast similarity search

» Towards Indexbased Similarity Search for Protein Structure Databases

Post Info
More Details (n/a)

Added	12 Oct 2010
Updated	12 Oct 2010
Type	Conference
Year	2008
Where	BIRD
Authors	Mikhail A. Roytberg, Anna Gambin, Laurent Noé, Slawomir Lasota, Eugenia Furletova, Ewa Szczurek, Gregory Kucherov

Comments (0)

Sciweavers

Efficient Seeding Techniques for Protein Similarity Search

Bioinformatics | BIRD 2008 | Efficient Seed Alphabets | Subset Seed | Vector Seeds |

Explore & Download

Productivity Tools

Sciweavers