Context sensitive vocabulary and its application in protein secondary structure prediction

16 years 2 days ago

Download www.cs.cmu.edu

Protein secondary structure prediction is an important step towards understanding the relation between protein sequence and structure. However, most current prediction methods use features diﬃcult for biologists to interpret. In this paper, we present a new method that applies information retrieval techniques to solve the problem: we extract a context sensitive biological vocabulary for protein sequences and apply text classiﬁcation methods to predict protein secondary structure. Experimental results show that our method performs comparably to the state-of-art methods. Furthermore, the context sensitive vocabularies can serve as a useful tool to discover meaningful regular expression patterns for protein structures. Categories and Subject Descriptors: H.4.4 [Information Systems Applications]: Miscellaneous General Terms: Algorithms, Experimentation.

Yan Liu, Jaime G. Carbonell, Judith Klein-Seethara

Real-time Traffic

Protein Secondary Structure | Protein Sequences | Secondary Structure Prediction | SIGIR 2004 |

claim paper

» CMASA an accurate algorithm for detecting local protein structural similarity and its appl...

» Predicting residuewise contact orders in proteins by support vector regression

» Application of amino acid occurrence for discriminating different folding types of globula...

» Linear predictive coding representation of correlated mutation for protein sequence alignm...

» 4SALE A tool for synchronous RNA sequence and secondary structure alignment and editing

» Detailed estimation of bioinformatics prediction reliability through the Fragmented Predic...

» Validation of protein models by a neural network approach

» A new protein binding pocket similarity measure based on comparison of clouds of atoms in ...

Post Info
More Details (n/a)

Added	30 Jun 2010
Updated	30 Jun 2010
Type	Conference
Year	2004
Where	SIGIR
Authors	Yan Liu, Jaime G. Carbonell, Judith Klein-Seetharaman, Vanathi Gopalakrishnan

Comments (0)

Sciweavers

Context sensitive vocabulary and its application in protein secondary structure prediction

Protein Secondary Structure | Protein Sequences | Secondary Structure Prediction | SIGIR 2004 |

Explore & Download

Productivity Tools

Sciweavers