Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

99

BMCBI
2008

favoriteEmaildiscussreport

164views more BMCBI 2008»

Word correlation matrices for protein sequence analysis and remote homology detection

15 years 9 days ago

Word correlation matrices for protein sequence analysis and remote homology detection

Download www.biomedcentral.com

Background: Classification of protein sequences is a central problem in computational biology. Currently, among computational methods discriminative kernel-based approaches provide the most accurate results. However, kernel-based methods often lack an interpretable model for analysis of discriminative sequence features, and predictions on new sequences usually are computationally expensive. Results: In this work we present a novel kernel for protein sequences based on average word similarity between two sequences. We show that this kernel gives rise to a feature space that allows analysis of discriminative features and fast classification of new sequences. We demonstrate the performance of our approach on a widely-used benchmark setup for protein remote homology detection. Conclusion: Our word correlation approach provides highly competitive performance as compared with state-of-the-art methods for protein remote homology detection. The learned model is interpretable in terms of biolo...

Thomas Lingner, Peter Meinicke

Real-time Traffic

BMCBI 2008 | Protein Remote Homology | Protein Sequences | Sequences |

claim paper

Related Content

» A Discriminative Framework for Detecting Remote Protein Homologies

» A Novel Approach to Remote Homology Detection Jumping Alignments

» Application of nonnegative matrix factorization to improve profileprofile alignment featur...

» A discriminative method for protein remote homology detection and fold recognition combini...

» Protein structure similarity from principle component correlation analysis

» Optimizing amino acid substitution matrices with a local alignment kernel

» ProtoMap automatic classification of protein sequences and hierarchy of protein families

» Clustering protein sequences with a novel metric transformed from sequence similarity scor...

» On the Role of Local Matching for Efficient Semisupervised Protein Sequence Classification

Post Info
More Details (n/a)

Added	09 Dec 2010
Updated	09 Dec 2010
Type	Journal
Year	2008
Where	BMCBI
Authors	Thomas Lingner, Peter Meinicke

Comments (0)