Sciweavers

947 search results - page 98 / 190
» Evaluation of Sampling for Data Mining of Association Rules
Sort
View
ICEIS
2009
IEEE
15 years 7 months ago
Minable Data Warehouse
Data warehouses have been widely used in various capacities such as large corporations or public institutions. These systems contain large and rich datasets that are often used by ...
David Morgan, Jai W. Kang, James M. Kang
SDM
2010
SIAM
184views Data Mining» more  SDM 2010»
15 years 2 months ago
A Robust Decision Tree Algorithm for Imbalanced Data Sets
We propose a new decision tree algorithm, Class Confidence Proportion Decision Tree (CCPDT), which is robust and insensitive to class distribution and generates rules which are st...
Wei Liu, Sanjay Chawla, David A. Cieslak, Nitesh V...
ICDE
2003
IEEE
146views Database» more  ICDE 2003»
16 years 2 months ago
Similarity Search in Sets and Categorical Data Using the Signature Tree
Data mining applications analyze large collections of set data and high dimensional categorical data. Search on these data types is not restricted to the classic problems of minin...
Nikos Mamoulis, David W. Cheung, Wang Lian
CIBCB
2008
IEEE
15 years 7 months ago
Very large scale ReliefF for genome-wide association analysis
— The genetic causes of many monogenic diseases have already been discovered. However, most common diseases are actually the result of complex nonlinear interactions between mult...
Margaret J. Eppstein, Paul Haake
115
Voted
KDD
2009
ACM
227views Data Mining» more  KDD 2009»
16 years 1 months ago
Efficiently learning the accuracy of labeling sources for selective sampling
Many scalable data mining tasks rely on active learning to provide the most useful accurately labeled instances. However, what if there are multiple labeling sources (`oracles...
Pinar Donmez, Jaime G. Carbonell, Jeff Schneider