Sciweavers

89 search results - page 3 / 18
» Exploiting Dataset Similarity for Distributed Mining
Sort
View
DBISP2P
2008
Springer
124views Database» more  DBISP2P 2008»
13 years 7 months ago
Exploiting Distribution Skew for Scalable P2P Text Clustering
K-Means clustering is widely used in information retrieval and data mining. Distributed K-Means variants have already been proposed, but none of the past algorithms scales to large...
Odysseas Papapetrou, Wolf Siberski, Fabian Leitrit...
PKDD
2010
Springer
212views Data Mining» more  PKDD 2010»
13 years 4 months ago
Cross Validation Framework to Choose amongst Models and Datasets for Transfer Learning
Abstract. One solution to the lack of label problem is to exploit transfer learning, whereby one acquires knowledge from source-domains to improve the learning performance in the t...
ErHeng Zhong, Wei Fan, Qiang Yang, Olivier Versche...
SDM
2009
SIAM
176views Data Mining» more  SDM 2009»
14 years 3 months ago
Discovery of Geospatial Discriminating Patterns from Remote Sensing Datasets.
Large amounts of remotely sensed data calls for data mining techniques to fully utilize their rich information content. In this paper, we study new means of discovery and summariz...
Wei Ding 0003, Tomasz F. Stepinski, Josue Salazar
ICDM
2003
IEEE
220views Data Mining» more  ICDM 2003»
13 years 11 months ago
Exploiting Unlabeled Data for Improving Accuracy of Predictive Data Mining
Predictive data mining typically relies on labeled data without exploiting a much larger amount of available unlabeled data. The goal of this paper is to show that using unlabeled...
Kang Peng, Slobodan Vucetic, Bo Han, Hongbo Xie, Z...
COLING
2010
13 years 24 days ago
Corpus-based Semantic Class Mining: Distributional vs. Pattern-Based Approaches
Main approaches to corpus-based semantic class mining include distributional similarity (DS) and pattern-based (PB). In this paper, we perform an empirical comparison of them, bas...
Shuming Shi, Huibin Zhang, Xiaojie Yuan, Ji-Rong W...