Automatic detection of false annotations via binary property clustering

15 years 4 months ago

Download www.biomedcentral.com

Background: Computational protein annotation methods occasionally introduce errors. Falsepositive (FP) errors are annotations that are mistakenly associated with a protein. Such false annotations introduce errors that may spread into databases through similarity with other proteins. Generally, methods used to minimize the chance for FPs result in decreased sensitivity or low throughput. We present a novel protein-clustering method that enables automatic separation of FP from true hits. The method quantifies the biological similarity between pairs of proteins by examining each protein's annotations, and then proceeds by clustering sets of proteins that received similar annotation into biological groups. Results: Using a test set of all PROSITE signatures that are marked as FPs, we show that the method successfully separates FPs in 69% of the 327 test cases supplied by PROSITE. Furthermore, we constructed an extensive random FP simulation test and show a high degree of success in d...

Noam Kaplan, Michal Linial

Real-time Traffic

Annotation | BMCBI 2005 | Computational Protein Annotation | Proteins |

claim paper

Post Info
More Details (n/a)

Added	15 Dec 2010
Updated	15 Dec 2010
Type	Journal
Year	2005
Where	BMCBI
Authors	Noam Kaplan, Michal Linial

Comments (0)

Sciweavers

Automatic detection of false annotations via binary property clustering

Annotation | BMCBI 2005 | Computational Protein Annotation | Proteins |

Explore & Download

Productivity Tools

Sciweavers