Sciweavers

ALMOB
2006

Pattern statistics on Markov chains and sensitivity to parameter estimation

14 years 25 days ago
Pattern statistics on Markov chains and sensitivity to parameter estimation
Background: In order to compute pattern statistics in computational biology a Markov model is commonly used to take into account the sequence composition. Usually its parameter must be estimated. The aim of this paper is to determine how sensitive these statistics are to parameter estimation, and what are the consequences of this variability on pattern studies (finding the most over-represented words in a genome, the most significant common words to a set of sequences,...). Results: In the particular case where pattern statistics (overlap counting only) computed through binomial approximations we use the delta-method to give an explicit expression of , the standard deviation of a pattern statistic. This result is validated using simulations and a simple pattern study is also considered. Conclusion: We establish that the use of high order Markov model could easily lead to major mistakes due to the high sensitivity of pattern statistics to parameter estimation. Background In order to st...
Grégory Nuel
Added 10 Dec 2010
Updated 10 Dec 2010
Type Journal
Year 2006
Where ALMOB
Authors Grégory Nuel
Comments (0)