Sciweavers

BIBE
2007
IEEE

A New Alignment-Independent Algorithm for Clustering Protein Sequences

13 years 11 months ago
A New Alignment-Independent Algorithm for Clustering Protein Sequences
—The rapid burgeoning of available protein data makes the use of clustering within families of proteins increasingly important, the challenge is to identify subfamilies of evolutionarily related sequences. This identification reveals phylogenetic relationships, which provide prior knowledge to help researchers understand biological phenomena. A good evolutionary model is essential to achieve a clustering that reflects the biological reality, and an accurate estimate of protein sequence similarity is crucial to the building of such a model. Most existing algorithms estimate this similarity using techniques that are not necessarily biologically plausible, especially for hardto-align sequences such as multi-domain, circular-permutation and tandem-repeats protein sequences, which cause many difficulties for the alignment-dependent algorithms. In this paper, we propose a novel similarity measure based on matching amino acid subsequences. This measure, named SMS for Substitution Matching S...
Abdellali Kelil, Shengrui Wang, Ryszard Brzezinski
Added 02 Jun 2010
Updated 02 Jun 2010
Type Conference
Year 2007
Where BIBE
Authors Abdellali Kelil, Shengrui Wang, Ryszard Brzezinski
Comments (0)