Sciweavers

319 search results - page 22 / 64
» Algorithms for Mining Distance-Based Outliers in Large Datas...
Sort
View
BMCBI
2007
102views more  BMCBI 2007»
14 years 10 months ago
Setting up a large set of protein-ligand PDB complexes for the development and validation of knowledge-based docking algorithms
Background: The number of algorithms available to predict ligand-protein interactions is large and ever-increasing. The number of test cases used to validate these methods is usua...
Luis A. Diago, Persy Morell, Longendri Aguilera, E...
IDEAL
2003
Springer
15 years 3 months ago
Experiences of Using a Quantitative Approach for Mining Association Rules
In recent years interest has grown in “mining” large databases to extract novel and interesting information. Knowledge Discovery in Databases (KDD) has been recognised as an em...
L. Dong, Christos Tjortjis
SDM
2009
SIAM
114views Data Mining» more  SDM 2009»
15 years 7 months ago
GAD: General Activity Detection for Fast Clustering on Large Data.
In this paper, we propose GAD (General Activity Detection) for fast clustering on large scale data. Within this framework we design a set of algorithms for different scenarios: (...
Jiawei Han, Liangliang Cao, Sangkyum Kim, Xin Jin,...
AUSDM
2007
Springer
101views Data Mining» more  AUSDM 2007»
15 years 1 months ago
Exploratory Multilevel Hot Spot Analysis: Australian Taxation Office Case Study
Population based real-life datasets often contain smaller clusters of unusual sub-populations. While these clusters, called `hot spots', are small and sparse, they are usuall...
Denny, Graham J. Williams, Peter Christen
ICCS
2009
Springer
15 years 4 months ago
Frequent Itemset Mining for Clustering Near Duplicate Web Documents
A vast amount of documents in the Web have duplicates, which is a challenge for developing efficient methods that would compute clusters of similar documents. In this paper we use ...
Dmitry I. Ignatov, Sergei O. Kuznetsov