Sciweavers

ISMB
2001

Using mixtures of common ancestors for estimating the probabilities of discrete events in biological sequences

14 years 25 days ago
Using mixtures of common ancestors for estimating the probabilities of discrete events in biological sequences
Accurately estimating probabilities from observations is important for probabilistic-based approaches to problems in computational biology. In this paper we present a biologically-motivated method for estimating probability distributions over discrete alphabets from observations using a mixture model of common ancestors. The method is an extension of substitution matrix-based probability estimation methods. In contrast to previous substitution matrix-based methods, our method has a simple Bayesian interpretation. The method presented in this paper has the advantage over Dirichlet mixtures that it is both effective and simple to compute for large alphabets. The method is applied to estimate amino acid probabilities based on observed counts in an alignment and is shown to perform comparable to previous methods. The method is also applied to estimate probability distributions over protein families and improves protein classification accuracy.
Eleazar Eskin, William Noble Grundy, Yoram Singer
Added 31 Oct 2010
Updated 31 Oct 2010
Type Conference
Year 2001
Where ISMB
Authors Eleazar Eskin, William Noble Grundy, Yoram Singer
Comments (0)