Sciweavers

1425 search results - page 75 / 285
» Similarity measures for multidimensional data
Sort
View
115
Voted
SIGMOD
2010
ACM
228views Database» more  SIGMOD 2010»
15 years 5 months ago
Probabilistic string similarity joins
Edit distance based string similarity join is a fundamental operator in string databases. Increasingly, many applications in data cleaning, data integration, and scientific compu...
Jeffrey Jestes, Feifei Li, Zhepeng Yan, Ke Yi
124
Voted
WSDM
2012
ACM
304views Data Mining» more  WSDM 2012»
13 years 8 months ago
Beyond co-occurrence: discovering and visualizing tag relationships from geo-spatial and temporal similarities
Studying relationships between keyword tags on social sharing websites has become a popular topic of research, both to improve tag suggestion systems and to discover connections b...
Haipeng Zhang, Mohammed Korayem, Erkang You, David...
SIGMETRICS
1998
ACM
15 years 4 months ago
Self-Similarity in File Systems
We demonstrate that high-level le system events exhibit selfsimilar behaviour, but only for short-term time scales of approximately under a day. We do so through the analysis of f...
Steven D. Gribble, Gurmeet Singh Manku, Drew S. Ro...
CN
2006
163views more  CN 2006»
15 years 18 days ago
A framework for mining evolving trends in Web data streams using dynamic learning and retrospective validation
The expanding and dynamic nature of the Web poses enormous challenges to most data mining techniques that try to extract patterns from Web data, such as Web usage and Web content....
Olfa Nasraoui, Carlos Rojas, Cesar Cardona
103
Voted
CSB
2005
IEEE
110views Bioinformatics» more  CSB 2005»
15 years 6 months ago
A Topological Measurement for Weighted Protein Interaction Network
High-throughput methods for detecting protein-protein interactions (PPI) have given researchers an initial global picture of protein interactions on a genomic scale. The usefulnes...
Pengjun Pei, Aidong Zhang