Sciweavers

ICDE
2004
IEEE

Unordered Tree Mining with Applications to Phylogeny

14 years 5 months ago
Unordered Tree Mining with Applications to Phylogeny
Frequent structure mining (FSM) aims to discover and extract patterns frequently occurring in structural data, such as trees and graphs. FSM finds many applications in bioinformatics, XML processing, Web log analysis, and so on. In this paper we present a new FSM technique for finding patterns in rooted unordered labeled trees. The patterns of interest are cousin pairs in these trees. A cousin pair is a pair of nodes sharing the same parent, the same grandparent, or the same great-grandparent, etc. Given a tree ? , our algorithm finds all interesting cousin pairs of ? in?????? ??? time where ?? ? is the number of nodes in ? . Experimental results on synthetic data and phylogenies show the scalability and effectiveness of the proposed technique. To demonstrate the usefulness of our approach, we discuss its applications to locating co-occurring patterns in multiple evolutionary trees, evaluating the consensus of equally parsimonious trees, and finding kernel trees of groups of phylogeni...
Dennis Shasha, Jason Tsong-Li Wang, Sen Zhang
Added 01 Nov 2009
Updated 01 Nov 2009
Type Conference
Year 2004
Where ICDE
Authors Dennis Shasha, Jason Tsong-Li Wang, Sen Zhang
Comments (0)