Sciweavers

ISMB
1996

Discovering Patterns and Subfamilies in Biosequences

13 years 6 months ago
Discovering Patterns and Subfamilies in Biosequences
Weconsider the problemof automaticdiscoveryof patterns and the corresponding subfamilies in a set of biosequences. Thesequences are unaligned and may contain noise of unknownlevel. Thepatterns are of the type used in PROSITEdatabase. In our approach we discover patterns and the respective subfamilies simultaneously. Wedevelopa theoretically substantiated significance measurefor a set of such patterns and an algorithm approximatingthe best pattern set and the subfamilies. The approach is based on the minimum description length (MDL)principle. Wereport a computing experimentcorrectly finding subfamilies in the family of chromodomainsand revealing newstrong patterns.
Alvis Brazma, Inge Jonassen, Esko Ukkonen, Jaak Vi
Added 02 Nov 2010
Updated 02 Nov 2010
Type Conference
Year 1996
Where ISMB
Authors Alvis Brazma, Inge Jonassen, Esko Ukkonen, Jaak Vilo
Comments (0)