Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

43

PADL
2009
Springer

favoriteEmaildiscussreport

591views Programming Languages» more PADL 2009»

Using Bloom Filters for Large Scale Gene Sequence Analysis in Haskell

14 years 5 months ago

Using Bloom Filters for Large Scale Gene Sequence Analysis in Haskell

Download www.serpentine.com

Analysis of biological data often involves large data sets and computationally expensive algorithms. Databases of biological data continue to grow, leading to an increasing demand for improved algorithms and data structures. Despite having many advantages over more traditional indexing structures, the Bloom filter is almost unused in bioinformatics. Here we present a robust and efficient Bloom filter implementation in Haskell, and implement a simple bioinformatics application for indexing and matching sequence data. We use this to index the chromosomes that make up the human genome, and map all available gene sequences to it. Our experiences with developing and tuning our application suggest that for bioinformatics applications, Haskell offers a compelling combination of rapid development, quality assurance, and high performance.

Ketil Malde, Bryan O'Sullivan

Real-time Traffic

Biological Data | Efficient Bloom Filter | PADL 2009 | Programming Languages | Simple Bioinformatics Application |

claim paper

Related Content

» Gene models from ESTs GeneModelEST an application on the Solanum lycopersicum genome

» Gene prediction in metagenomic fragments A large scale machine learning approach

» NOTUNG A Program for Dating Gene Duplications and Optimizing Gene Family Trees

» Proteinortho Detection of CoOrthologs in LargeScale Analysis

» PSAT A web tool to compare genomic neighborhoods of multiple prokaryotic genomes

» CGKB an annotation knowledge base for cowpea Vigna unguiculata L methylation filtered geno...

» Decoding Synteny Blocks and LargeScale Duplications in Mammalian and Plant Genomes

» Washing scaling of GeneChip microarray expression

» Enrichment of homologs in insignificant BLAST hits by cocomplex network alignment

Post Info
More Details (n/a)

Added	22 Nov 2009
Updated	22 Nov 2009
Type	Conference
Year	2009
Where	PADL
Authors	Ketil Malde, Bryan O'Sullivan

Comments (0)