Sciweavers

72 search results - page 6 / 15
» sdm 2007
Sort
View
SDM
2007
SIAM
137views Data Mining» more  SDM 2007»
14 years 11 months ago
Semi-supervised Feature Selection via Spectral Analysis
Feature selection is an important task in effective data mining. A new challenge to feature selection is the so-called “small labeled-sample problem” in which labeled data is...
Zheng Zhao, Huan Liu
SDM
2007
SIAM
120views Data Mining» more  SDM 2007»
14 years 11 months ago
An Analysis of Logistic Models: Exponential Family Connections and Online Performance
Logistic models are arguably one of the most widely used data analysis techniques. In this paper, we present analyses focussing on two important aspects of logistic models—its r...
Arindam Banerjee
SDM
2007
SIAM
130views Data Mining» more  SDM 2007»
14 years 11 months ago
Maximizing the Area under the ROC Curve with Decision Lists and Rule Sets
Decision lists (or ordered rule sets) have two attractive properties compared to unordered rule sets: they require a simpler classification procedure and they allow for a more co...
Henrik Boström
SDM
2007
SIAM
171views Data Mining» more  SDM 2007»
14 years 11 months ago
A Better Alternative to Piecewise Linear Time Series Segmentation
Time series are difficult to monitor, summarize and predict. Segmentation organizes time series into few intervals having uniform characteristics (flatness, linearity, modality,...
Daniel Lemire
SDM
2007
SIAM
118views Data Mining» more  SDM 2007»
14 years 11 months ago
On Privacy-Preservation of Text and Sparse Binary Data with Sketches
In recent years, privacy preserving data mining has become very important because of the proliferation of large amounts of data on the internet. Many data sets are inherently high...
Charu C. Aggarwal, Philip S. Yu