Sciweavers

BMCBI
2002

Making sense of EST sequences by CLOBBing them

13 years 4 months ago
Making sense of EST sequences by CLOBBing them
Background: Expressed sequence tags (ESTs) are single pass reads from randomly selected cDNA clones. They provide a highly cost-effective method to access and identify expressed genes. However, they are often prone to sequencing errors and typically define incomplete transcripts. To increase the amount of information obtainable from ESTs and reduce sequencing errors, it is necessary to cluster ESTs into groups sharing significant sequence similarity. Results: As part of our ongoing EST programs investigating 'orphan' genomes, we have developed a clustering algorithm, CLOBB (Cluster on the basis of BLAST similarity) to identify and cluster ESTs. CLOBB may be used incrementally, preserving original cluster designations. It tracks clusterspecific events such as merging, identifies 'superclusters' of related clusters and avoids the expansion of chimeric clusters. Based on the Perl scripting language, CLOBB is highly portable relying only on a local installation of NCBI...
John Parkinson, David B. Guiliano, Mark L. Blaxter
Added 17 Dec 2010
Updated 17 Dec 2010
Type Journal
Year 2002
Where BMCBI
Authors John Parkinson, David B. Guiliano, Mark L. Blaxter
Comments (0)