Sciweavers

17688 search results - page 48 / 3538
» Data Set Balancing
Sort
View
110
Voted
KDD
2002
ACM
138views Data Mining» more  KDD 2002»
16 years 2 months ago
Learning to match and cluster large high-dimensional data sets for data integration
Part of the process of data integration is determining which sets of identifiers refer to the same real-world entities. In integrating databases found on the Web or obtained by us...
William W. Cohen, Jacob Richman
NCA
2007
IEEE
15 years 2 months ago
Handling of incomplete data sets using ICA and SOM in data mining
Based on independent component analysis (ICA) and self-organizing maps (SOM), this paper proposes an ISOM-DH model for the incomplete data’s handling in data mining. Under these ...
Hongyi Peng, Siming Zhu
138
Voted
MICRO
1994
IEEE
99views Hardware» more  MICRO 1994»
15 years 6 months ago
Data relocation and prefetching for programs with large data sets
Numerical applications frequently contain nested loop structures that process large arrays of data. The execution of these loop structures often produces memory preference pattern...
Yoji Yamada, John Gyllenhall, Grant Haab, Wen-mei ...
118
Voted
DIS
2006
Springer
15 years 6 months ago
On Class Visualisation for High Dimensional Data: Exploring Scientific Data Sets
Parametric Embedding (PE) has recently been proposed as a general-purpose algorithm for class visualisation. It takes class posteriors produced by a mixture-based clustering algori...
Ata Kabán, Jianyong Sun, Somak Raychaudhury...
130
Voted
BERTINORO
2005
Springer
15 years 8 months ago
Multifaceted Simultaneous Load Balancing in DHT-Based P2P Systems: A New Game with Old Balls and Bins
In this paper we present and evaluate uncoordinated on-line algorithms for simultaneous storage and replication load-balancing in DHT-based peer-to-peer systems. We compare our ap...
Karl Aberer, Anwitaman Datta, Manfred Hauswirth