Sciweavers

17688 search results - page 4 / 3538
» Data Set Balancing
Sort
View
SDM
2010
SIAM
184views Data Mining» more  SDM 2010»
14 years 11 months ago
A Robust Decision Tree Algorithm for Imbalanced Data Sets
We propose a new decision tree algorithm, Class Confidence Proportion Decision Tree (CCPDT), which is robust and insensitive to class distribution and generates rules which are st...
Wei Liu, Sanjay Chawla, David A. Cieslak, Nitesh V...
120
Voted
ICDE
2012
IEEE
216views Database» more  ICDE 2012»
12 years 12 months ago
Load Balancing in MapReduce Based on Scalable Cardinality Estimates
—MapReduce has emerged as a popular tool for distributed and scalable processing of massive data sets and is increasingly being used in e-science applications. Unfortunately, the...
Benjamin Gufler, Nikolaus Augsten, Angelika Reiser...
DOLAP
1998
ACM
15 years 1 months ago
Dynamic Maintenance of Multidimensional Range Data Partitioning for Parallel Data Processing
Star schema has been a typical model for both online transaction processing in traditional databases and online analytical processing in large data warehouses. In the star schema,...
Junping Sun, William I. Grosky
88
Voted
ISCA
2006
IEEE
169views Hardware» more  ISCA 2006»
15 years 3 months ago
Balanced Cache: Reducing Conflict Misses of Direct-Mapped Caches
Level one cache normally resides on a processor’s critical path, which determines the clock frequency. Directmapped caches exhibit fast access time but poor hit rates compared w...
Chuanjun Zhang
BIB
2006
69views more  BIB 2006»
14 years 9 months ago
Flux balance analysis in the era of metabolomics
Flux balance analysis (FBA) has emerged as an effective means to analyse biological networks in a quantitative manner. Much progress has been made on the extension of FBA to incor...
Jong Min Lee, Erwin P. Gianchandani, Jason A. Papi...