Sciweavers

RECOMB
2006
Springer

A Sublinear-Time Randomized Approximation Scheme for the Robinson-Foulds Metric

14 years 4 months ago
A Sublinear-Time Randomized Approximation Scheme for the Robinson-Foulds Metric
The Robinson-Foulds (RF) metric is the measure most widely used in comparing phylogenetic trees; it can be computed in linear time using Day's algorithm. When faced with the need to compare large numbers of large trees, however, even linear time becomes prohibitive. We present a randomized approximation scheme that provides, with high probability, a (1 + ) approximation of the true RF metric for all pairs of trees in a given collection. Our approach is to use a sublinear-space embedding of the trees, combined with an application of the Johnson-Lindenstrauss lemma to approximate vector norms very rapidly. We discuss the consequences of various parameter choices (in the embedding and in the approximation requirements). We also implemented our algorithm as a Java class that can easily be combined with popular packages such as Mesquite; in consequence, we present experimental results illustrating the precision and running-time tradeoffs as well as demonstrating the speed of our approa...
Nicholas D. Pattengale, Bernard M. E. Moret
Added 03 Dec 2009
Updated 03 Dec 2009
Type Conference
Year 2006
Where RECOMB
Authors Nicholas D. Pattengale, Bernard M. E. Moret
Comments (0)