The similarity metric

15 years 5 months ago

Download homepages.cwi.nl

—A new class of distances appropriate for measuring similarity relations between sequences, say one type of similarity per distance, is studied. We propose a new “normalized information distance,” based on the noncomputable notion of Kolmogorov complexity, and show that it is in this class and it minorizes every computable distance in the class (that is, it is universal in that it discovers all computable similarities). We demonstrate that it is a metric and call it the similarity metric. This theory forms the foundation for a new practical tool. To evidence generality and robustness, we give two distinctive applications in widely divergent areas using standard compression programs like gzip and GenCompress. First, we compare whole mitochondrial genomes and infer their evolutionary history. This results in a ﬁrst completely automatic computed whole mitochondrial phylogeny tree. Secondly, we fully automatically compute the language tree of 52 different languages.

Ming Li, Xin Chen, Xin Li, Bin Ma, Paul M. B. Vit&

Real-time Traffic

Algorithms | Computable Distance | Normalized Information Distance | Similarity Relations | SODA 2003 |

claim paper

» Density geodesics for similarity clustering

» Pivoting Mtree A Metric Access Method for Efficient Similarity Search

» Combining Multiple Similarity Metrics Using a Multicriteria Approach

» An Improved SVM Based on Similarity Metric

» A ContentAddressable Network for Similarity Search in Metric Spaces

» PotentialBased Hierarchical Clustering

» On Fuzzy vs Metric Similarity Search in Complex Databases

» New Area MatrixBased AffineInvariant Shape Features and Similarity Metrics

» Employing Trainable String Similarity Metrics for Information Integration

Post Info
More Details (n/a)

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	2003
Where	SODA
Authors	Ming Li, Xin Chen, Xin Li, Bin Ma, Paul M. B. Vitányi

Comments (0)

Sciweavers

The similarity metric

Algorithms | Computable Distance | Normalized Information Distance | Similarity Relations | SODA 2003 |

Explore & Download

Productivity Tools

Sciweavers