Sciweavers

BMCBI
2010

Predicting conserved protein motifs with Sub-HMMs

12 years 11 months ago
Predicting conserved protein motifs with Sub-HMMs
Background: Profile HMMs (hidden Markov models) provide effective methods for modeling the conserved regions of protein families. A limitation of the resulting domain models is the difficulty to pinpoint their much shorter functional sub-features, such as catalytically relevant sequence motifs in enzymes or ligand binding signatures of receptor proteins. Results: To identify these conserved motifs efficiently, we propose a method for extracting the most information-rich regions in protein families from their profile HMMs. The method was used here to predict a comprehensive set of subHMMs from the Pfam domain database. Cross-validations with the PROSITE and CSA databases confirmed the efficiency of the method in predicting most of the known functionally relevant motifs and residues. At the same time, 46,768 novel conserved regions could be predicted. The data set also allowed us to link at least 461 Pfam domains of known and unknown function by their common sub-HMMs. Finally, the sub-H...
Kevin Horan, Christian R. Shelton, Thomas Girke
Added 12 May 2011
Updated 12 May 2011
Type Journal
Year 2010
Where BMCBI
Authors Kevin Horan, Christian R. Shelton, Thomas Girke
Comments (0)