Sciweavers

JCB
2007

Effects of Long-Range Correlations in DNA on Sequence Alignment Score Statistics

13 years 4 months ago
Effects of Long-Range Correlations in DNA on Sequence Alignment Score Statistics
Long-range correlations in genomic base composition are a ubiquitous statistical feature among many eukaryotic genomes. In this article, these correlations are shown to substantially influence the statistics of sequence alignment scores. Using a Gaussian approximation to model the correlated score landscape, we calculate the corrections to the scale parameter of the extreme value distribution of alignment scores. Our approximate analytic results are supported by a detailed numerical study based on a simple algorithm to efficiently generate long-range correlated random sequences. We find both, mean and exponential tail of the score distribution for long-range correlated sequences to be substantially shifted compared to random sequences with independent nucleotides. The significance of measured alignment scores will therefore change upon incorporation of the correlations in the null model. We discuss the magnitude of this effect in a biological context. Key words: extreme value dist...
Philipp W. Messer, Ralf Bundschuh, Martin Vingron,
Added 15 Dec 2010
Updated 15 Dec 2010
Type Journal
Year 2007
Where JCB
Authors Philipp W. Messer, Ralf Bundschuh, Martin Vingron, Peter F. Arndt
Comments (0)