Clustering protein sequences with a novel metric transformed from sequence similarity scores and sequence alignments with neural

15 years 10 days ago

Download www.biomedcentral.com

Background: The sequencing of the human genome has enabled us to access a comprehensive list of genes (both experimental and predicted) for further analysis. While a majority of the approximately 30000 known and predicted human coding genes are characterized and have been assigned at least one function, there remains a fair number of genes (about 12000) for which no annotation has been made. The recent sequencing of other genomes has provided us with a huge amount of auxiliary sequence data which could help in the characterization of the human genes. Clustering these sequences into families is one of the first steps to perform comparative studies across several genomes. Results: Here we report a novel clustering algorithm (CLUGEN) that has been used to cluster sequences of experimentally verified and predicted proteins from all sequenced genomes using a novel distance metric which is a neural network score between a pair of protein sequences. This distance metric is based on the pairw...

Qicheng Ma, Gung-Wei Chirn, Richard Cai, Joseph D.

Real-time Traffic

BMCBI 2005 | Genomes | Protein | Protein Sequences |

claim paper

» Connect the dots exposing hidden protein family connections from the entire sequence tree

» The distanceprofile representation and its application to detection of distantly related p...

» BackTranslation for Discovering Distant Protein Homologies

» Clustered Sequence Representation for Fast Homology Search

» PFAAT version 20 A tool for editing annotating and analyzing multiple sequence alignments

» Degenerate Primer Design via Clustering

» Structure alignment based on coding of local geometric measures

» Integrative network alignment reveals large regions of global network similarity in yeast ...

Post Info
More Details (n/a)

Added	15 Dec 2010
Updated	15 Dec 2010
Type	Journal
Year	2005
Where	BMCBI
Authors	Qicheng Ma, Gung-Wei Chirn, Richard Cai, Joseph D. Szustakowski, N. R. Nirmala

Comments (0)

Sciweavers

Clustering protein sequences with a novel metric transformed from sequence similarity scores and sequence alignments with neural

BMCBI 2005 | Genomes | Protein | Protein Sequences |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers