Fast Protein Superfamily Classification Using Principal Component Null Space Analysis

15 years 7 months ago

Download www.cs.uwindsor.ca

Abstract. The protein family classification problem, which consists of determining the family memberships of given unknown protein sequences, is very important for a biologist for many practical reasons, such as drug discovery, prediction of molecular functions and medical diagnosis. Neural networks and bayesian methods have performed well on the protein classification problem, achieving accuracy ranging from 90% to 98% while running relatively slow in the learning stage. In this paper, we present a principal component null space analysis (PCNSA) linear classifier to the problem and report excellent results compared to those of neural networks and support vector machines. The two main parameters of PCNSA are linked to the high dimensionality of the dataset used, and were optimized in an exhaustive manner to maximize accuracy.

Leon French, Alioune Ngom, Luis Rueda

Real-time Traffic

AI 2005 | Artificial Intelligence | Classification Problem | Neural Networks | Protein Family Classification |

claim paper

» Feature selection and nearest centroid classification for protein mass spectrometry

» An improved classification of Gproteincoupled receptors using sequencederived features

» Accelerated Kernel Feature Analysis

Post Info
More Details (n/a)

Added	15 Dec 2010
Updated	15 Dec 2010
Type	Journal
Year	2005
Where	AI
Authors	Leon French, Alioune Ngom, Luis Rueda

Comments (0)

Sciweavers

Fast Protein Superfamily Classification Using Principal Component Null Space Analysis

AI 2005 | Artificial Intelligence | Classification Problem | Neural Networks | Protein Family Classification |

Explore & Download

Productivity Tools

Sciweavers