Sciweavers

IDEAL
2005
Springer

Exploiting Sequence Dependencies in the Prediction of Peroxisomal Proteins

14 years 5 months ago
Exploiting Sequence Dependencies in the Prediction of Peroxisomal Proteins
Prediction of peroxisomal matrix proteins generally depends on the presence of one of two distinct motifs at the end of the amino acid sequence. PTS1 peroxisomal proteins have a well conserved tripeptide at the C-terminal end. However, the preceding residues in the sequence arguably play a crucial role in targeting the protein to the peroxisome. Previous work in applying machine learning to the prediction of peroxisomal matrix proteins has failed to capitalize on the full extent of these dependencies. We benchmark a range of machine learning algorithms, and show that a classifier – based on the Support Vector Machine – produces more accurate results when dependencies between the conserved motif and the preceding section are exploited. We publish an updated and rigorously curated data set that results in increased prediction accuracy of most tested models.
Mark Wakabayashi, John Hawkins, Stefan Maetschke,
Added 27 Jun 2010
Updated 27 Jun 2010
Type Conference
Year 2005
Where IDEAL
Authors Mark Wakabayashi, John Hawkins, Stefan Maetschke, Mikael Bodén
Comments (0)