Optimal neighborhood indexing for protein similarity search

15 years 6 months ago

Download www.biomedcentral.com

Background: Similarity inference, one of the main bioinformatics tasks, has to face an exponential growth of the biological data. A classical approach used to cope with this data flow involves heuristics with large seed indexes. In order to speed up this technique, the index can be enhanced by storing additional information to limit the number of random memory accesses. However, this improvement leads to a larger index that may become a bottleneck. In the case of protein similarity search, we propose to decrease the index size by reducing the amino acid alphabet. Results: The paper presents two main contributions. First, we show that an optimal neighborhood indexing combining an alphabet reduction and a longer neighborhood leads to a reduction of 35% of memory involved into the process, without sacrificing the quality of results nor the computational time. Second, our approach led us to develop a new kind of substitution score matrices and their associated e-value parameters. In contr...

Pierre Peterlongo, Laurent Noé, Dominique L

Real-time Traffic

Amino Acid | BMCBI 2008 | Proteins | Substitution Score Matrices |

claim paper

» PSI indexing protein structures for fast similarity search

» On Optimizing DistanceBased Similarity Search for Biological Databases

» Towards Indexbased Similarity Search for Protein Structure Databases

» A fast indexing approach for protein structure comparison

» DDPIn Distance and density based protein indexing

» Effective Indexing and Filtering for Similarity Search in Large Biosequence Databases

» Neighborhood based fast graph search in large networks

» Optimal Combination of SOM Search in BestMatching Units and Map Neighborhood

Post Info
More Details (n/a)

Added	09 Dec 2010
Updated	09 Dec 2010
Type	Journal
Year	2008
Where	BMCBI
Authors	Pierre Peterlongo, Laurent Noé, Dominique Lavenier, Van Hoa Nguyen, Gregory Kucherov, Mathieu Giraud

Comments (0)

Sciweavers

Optimal neighborhood indexing for protein similarity search

Amino Acid | BMCBI 2008 | Proteins | Substitution Score Matrices |

Explore & Download

Productivity Tools

Sciweavers