Sciweavers

ICDM
2005
IEEE
139views Data Mining» more  ICDM 2005»
13 years 10 months ago
Stability of Feature Selection Algorithms
With the proliferation of extremely high-dimensional data, feature selection algorithms have become indispensable components of the learning process. Strangely, despite extensive ...
Alexandros Kalousis, Julien Prados, Melanie Hilari...
ICDM
2005
IEEE
166views Data Mining» more  ICDM 2005»
13 years 10 months ago
An Algorithm for In-Core Frequent Itemset Mining on Streaming Data
Frequent itemset mining is a core data mining operation and has been extensively studied over the last decade. This paper takes a new approach for this problem and makes two major...
Ruoming Jin, Gagan Agrawal
ICDM
2005
IEEE
168views Data Mining» more  ICDM 2005»
13 years 10 months ago
A Scalable Collaborative Filtering Framework Based on Co-Clustering
Collaborative filtering-based recommender systems, which automatically predict preferred products of a user using known preferences of other users, have become extremely popular ...
Thomas George, Srujana Merugu
ICDM
2005
IEEE
199views Data Mining» more  ICDM 2005»
13 years 10 months ago
CoLe: A Cooperative Data Mining Approach and Its Application to Early Diabetes Detection
We present CoLe, a cooperative data mining approach for discovering hybrid knowledge. It employs multiple different data mining algorithms, and combines results from them to enhan...
Jie Gao, Jörg Denzinger, Robert C. James
ICDM
2005
IEEE
153views Data Mining» more  ICDM 2005»
13 years 10 months ago
Privacy-Preserving Frequent Pattern Mining across Private Databases
Privacy consideration has much significance in the application of data mining. It is very important that the privacy of individual parties will not be exposed when data mining te...
Ada Wai-Chee Fu, Raymond Chi-Wing Wong, Ke Wang
ICDM
2005
IEEE
138views Data Mining» more  ICDM 2005»
13 years 10 months ago
Labeling Unclustered Categorical Data into Clusters Based on the Important Attribute Values
Sampling has been recognized as an important technique to improve the efficiency of clustering. However, with sampling applied, those points which are not sampled will not have t...
Hung-Leng Chen, Kun-Ta Chuang, Ming-Syan Chen
ICDM
2005
IEEE
143views Data Mining» more  ICDM 2005»
13 years 10 months ago
A Computational Framework for Taxonomic Research: Diagnosing Body Shape within Fish Species Complexes
It is estimated that ninety percent of the world’s species have yet to be discovered and described. The main reason for the slow pace of new species description is that the scie...
Yixin Chen, Henry L. Bart Jr., Shuqing Huang, Huim...
ICDM
2005
IEEE
143views Data Mining» more  ICDM 2005»
13 years 10 months ago
Effective Estimation of Posterior Probabilities: Explaining the Accuracy of Randomized Decision Tree Approaches
There has been increasing number of independently proposed randomization methods in different stages of decision tree construction to build multiple trees. Randomized decision tre...
Wei Fan, Ed Greengrass, Joe McCloskey, Philip S. Y...
ICDM
2005
IEEE
133views Data Mining» more  ICDM 2005»
13 years 10 months ago
Summarization - Compressing Data into an Informative Representation
In this paper, we formulate the problem of summarization of a dataset of transactions with categorical attributes as an optimization problem involving two objective functions - co...
Varun Chandola, Vipin Kumar
ICDM
2005
IEEE
126views Data Mining» more  ICDM 2005»
13 years 10 months ago
Segment-Based Injection Attacks against Collaborative Filtering Recommender Systems
Significant vulnerabilities have recently been identified in collaborative filtering recommender systems. Researchers have shown that attackers can manipulate a system’s reco...
Robin D. Burke, Bamshad Mobasher, Runa Bhaumik, Ch...