Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

172

CSB
2002
IEEE

109views Bioinformatics» more CSB 2002»

Towards Automatic Clustering of Protein Sequences

16 years 3 hour ago

Towards Automatic Clustering of Protein Sequences

Download www.cs.unc.edu

Analyzing protein sequence data becomes increasingly important recently. Most previous work on this area has mainly focused on building classiﬁcation models. In this paper, we investigate in the problem of automatic clustering of unlabeled protein sequences. As a widely recognized technique in statistics and computer science, clustering has been proven very useful in detecting unknown object categories and revealing hidden correlations among objects. One difﬁculty that prevents clustering from being performed directly on protein sequence is the lack of an effective similarity measure that can be computed efﬁciently. Therefore, we propose a novel model for protein sequence cluster by exploring signiﬁcant statistical properties possessed by the sequences. The concept of imprecise probabilities are introduced to the original probabilistic sufﬁx tree to monitor the convergence of the empirical measurement and to guide the clustering process. It has been demonstrated that the pro...

Jiong Yang, Wei Wang 0010

Real-time Traffic

Bioinformatics | CSB 2002 | Protein Sequence | Protein Sequence Data | Unlabeled Protein Sequences |

claim paper

Related Content

» ProtoMap automatic classification of protein sequences and hierarchy of protein families

» GeneRAGE a robust algorithm for sequence clustering and domain detection

» A functional hierarchical organization of the protein sequence space

» Automatic Protein Function Annotation through Candidate Ortholog Clusters from Incomplete ...

» EVEREST automatic identification and classification of protein domains in all protein sequ...

» Protein structure alignment using elastic shape analysis

» SCPS a fast implementation of a spectral method for detecting protein families on a genome...

» Super paramagnetic clustering of protein sequences

» Towards an automatic classification of protein structural domains based on structural simi...

Post Info
More Details (n/a)

Added	14 Jul 2010
Updated	14 Jul 2010
Type	Conference
Year	2002
Where	CSB
Authors	Jiong Yang, Wei Wang 0010

Comments (0)