Sciweavers

234 search results - page 1 / 47
» A Simple Statistical Algorithm for Biological Sequence Compr...
Sort
View
DCC
2007
IEEE
14 years 4 months ago
A Simple Statistical Algorithm for Biological Sequence Compression
This paper introduces a novel algorithm for biological sequence compression that makes use of both statistical properties and repetition within sequences. A panel of experts is ma...
Minh Duc Cao, Trevor I. Dix, Lloyd Allison, Chris ...
RECOMB
2006
Springer
14 years 5 months ago
Alignment Statistics for Long-Range Correlated Genomic Sequences
It is well known that the base composition along eukaryotic genomes is long-range correlated. Here, we investigate the effect of such long-range correlations on alignment score sta...
Philipp W. Messer, Ralf Bundschuh, Martin Vingron,...
ALMOB
2007
151views more  ALMOB 2007»
13 years 4 months ago
Local sequence alignments statistics: deviations from Gumbel statistics in the rare-event tail
Background: The optimal score for ungapped local alignments of infinitely long random sequences is known to follow a Gumbel extreme value distribution. Less is known about the imp...
Stefan Wolfsheimer, Bernd Burghardt, Alexander K. ...
CSB
2003
IEEE
111views Bioinformatics» more  CSB 2003»
13 years 10 months ago
A Block Coding Method that Leads to Significantly Lower Entropy Values for the Proteins and Coding Sections of Haemophilus influ
A simple statistical block code in combination with the LZW-based compression utilities gzip and compress has been found to increase by a significant amount the level of compressi...
G. Sampath
BIBE
2010
IEEE
142views Bioinformatics» more  BIBE 2010»
12 years 11 months ago
Compressed q-Gram Indexing for Highly Repetitive Biological Sequences
The study of compressed storage schemes for highly repetitive sequence collections has been recently boosted by the availability of cheaper sequencing technologies and the flood of...
Francisco Claude, Antonio Fariña, Miguel A....