Sciweavers

ICDM
2006
IEEE
182views Data Mining» more  ICDM 2006»
13 years 10 months ago
Active Learning to Maximize Area Under the ROC Curve
In active learning, a machine learning algorithm is given an unlabeled set of examples U, and is allowed to request labels for a relatively small subset of U to use for training. ...
Matt Culver, Kun Deng, Stephen D. Scott
ICDM
2006
IEEE
127views Data Mining» more  ICDM 2006»
13 years 10 months ago
Optimal k-Anonymity with Flexible Generalization Schemes through Bottom-up Searching
In recent years, a major thread of research on kanonymity has focused on developing more flexible generalization schemes that produce higher-quality datasets. In this paper we in...
Tiancheng Li, Ninghui Li
ICDM
2006
IEEE
149views Data Mining» more  ICDM 2006»
13 years 10 months ago
P3C: A Robust Projected Clustering Algorithm
Gabriela Moise, Jörg Sander, Martin Ester
ICDM
2006
IEEE
92views Data Mining» more  ICDM 2006»
13 years 10 months ago
Window-based Tensor Analysis on High-dimensional and Multi-aspect Streams
Data stream values are often associated with multiple aspects. For example, each value from environmental sensors may have an associated type (e.g., temperature, humidity, etc) as...
Jimeng Sun, Spiros Papadimitriou, Philip S. Yu
ICDM
2006
IEEE
123views Data Mining» more  ICDM 2006»
13 years 10 months ago
Cluster Ranking with an Application to Mining Mailbox Networks
Ziv Bar-Yossef, Ido Guy, Ronny Lempel, Yoëlle...
ICDM
2006
IEEE
130views Data Mining» more  ICDM 2006»
13 years 10 months ago
Boosting for Learning Multiple Classes with Imbalanced Class Distribution
Classification of data with imbalanced class distribution has posed a significant drawback of the performance attainable by most standard classifier learning algorithms, which ...
Yanmin Sun, Mohamed S. Kamel, Yang Wang 0007
ICDM
2006
IEEE
131views Data Mining» more  ICDM 2006»
13 years 10 months ago
Dimension Reduction for Supervised Ordering
Ordered lists of objects are widely used as representational forms. Such ordered objects include Web search results and best-seller lists. Techniques for processing such ordinal d...
Toshihiro Kamishima, Shotaro Akaho
ICDM
2006
IEEE
89views Data Mining» more  ICDM 2006»
13 years 10 months ago
On the Lower Bound of Local Optimums in K-Means Algorithm
The k-means algorithm is a popular clustering method used in many different fields of computer science, such as data mining, machine learning and information retrieval. However, ...
Zhenjie Zhang, Bing Tian Dai, Anthony K. H. Tung
ICDM
2006
IEEE
146views Data Mining» more  ICDM 2006»
13 years 10 months ago
Boosting Kernel Models for Regression
This paper proposes a general boosting framework for combining multiple kernel models in the context of both classification and regression problems. Our main approach is built on...
Ping Sun, Xin Yao
ICDM
2006
IEEE
296views Data Mining» more  ICDM 2006»
13 years 10 months ago
Fast Random Walk with Restart and Its Applications
How closely related are two nodes in a graph? How to compute this score quickly, on huge, disk-resident, real graphs? Random walk with restart (RWR) provides a good relevance scor...
Hanghang Tong, Christos Faloutsos, Jia-Yu Pan