Sciweavers

17 search results - page 2 / 4
» Finding highly correlated pairs efficiently with powerful pr...
Sort
View
KDD
2008
ACM
181views Data Mining» more  KDD 2008»
14 years 5 months ago
Fastanova: an efficient algorithm for genome-wide association study
Studying the association between quantitative phenotype (such as height or weight) and single nucleotide polymorphisms (SNPs) is an important problem in biology. To understand und...
Xiang Zhang, Fei Zou, Wei Wang 0010
ICDE
2006
IEEE
222views Database» more  ICDE 2006»
14 years 6 months ago
CLAN: An Algorithm for Mining Closed Cliques from Large Dense Graph Databases
Most previously proposed frequent graph mining algorithms are intended to find the complete set of all frequent, closed subgraphs. However, in many cases only a subset of the freq...
Jianyong Wang, Zhiping Zeng, Lizhu Zhou
PVLDB
2010
195views more  PVLDB 2010»
13 years 1 days ago
Trie-Join: Efficient Trie-based String Similarity Joins with Edit-Distance Constraints
A string similarity join finds similar pairs between two collections of strings. It is an essential operation in many applications, such as data integration and cleaning, and has ...
Jiannan Wang, Guoliang Li, Jianhua Feng
BMCBI
2010
146views more  BMCBI 2010»
13 years 5 months ago
Revealing and avoiding bias in semantic similarity scores for protein pairs
Background: Semantic similarity scores for protein pairs are widely applied in functional genomic researches for finding functional clusters of proteins, predicting protein functi...
Jing Wang 0004, Xianxiao Zhou, Jing Zhu, Chenggui ...
ADBIS
2009
Springer
162views Database» more  ADBIS 2009»
13 years 9 months ago
Efficient Set Similarity Joins Using Min-prefixes
Identification of all objects in a dataset whose similarity is not less than a specified threshold is of major importance for management, search, and analysis of data. Set similari...
Leonardo Ribeiro, Theo Härder