Sciweavers

ICDM
2009
IEEE
131views Data Mining» more  ICDM 2009»
13 years 2 months ago
Unified Solution to Nonnegative Data Factorization Problems
In this paper, we restudy the non-convex data factorization problems (regularized or not, unsupervised or supervised), where the optimization is confined in the nonnegative orthan...
Xiaobai Liu, Shuicheng Yan, Jun Yan, Hai Jin
ICDM
2009
IEEE
150views Data Mining» more  ICDM 2009»
13 years 2 months ago
Filtering and Refinement: A Two-Stage Approach for Efficient and Effective Anomaly Detection
Anomaly detection is an important data mining task. Most existing methods treat anomalies as inconsistencies and spend the majority amount of time on modeling normal instances. A r...
Xiao Yu, Lu An Tang, Jiawei Han
ICDM
2009
IEEE
197views Data Mining» more  ICDM 2009»
13 years 2 months ago
A Linear-Time Graph Kernel
The design of a good kernel is fundamental for knowledge discovery from graph-structured data. Existing graph kernels exploit only limited information about the graph structures bu...
Shohei Hido, Hisashi Kashima
ICDM
2009
IEEE
134views Data Mining» more  ICDM 2009»
13 years 2 months ago
Efficient Discovery of Confounders in Large Data Sets
Given a large transaction database, association analysis is concerned with efficiently finding strongly related objects. Unlike traditional associate analysis, where relationships ...
Wenjun Zhou, Hui Xiong
ICDM
2009
IEEE
167views Data Mining» more  ICDM 2009»
13 years 2 months ago
Self-Adaptive Anytime Stream Clustering
Clustering streaming data requires algorithms which are capable of updating clustering results for the incoming data. As data is constantly arriving, time for processing is limited...
Philipp Kranen, Ira Assent, Corinna Baldauf, Thoma...
ICDM
2009
IEEE
153views Data Mining» more  ICDM 2009»
13 years 2 months ago
A New Clustering Algorithm Based on Regions of Influence with Self-Detection of the Best Number of Clusters
Clustering methods usually require to know the best number of clusters, or another parameter, e.g. a threshold, which is not ever easy to provide. This paper proposes a new graph-b...
Fabrice Muhlenbach, Stéphane Lallich
ICDM
2009
IEEE
154views Data Mining» more  ICDM 2009»
13 years 2 months ago
GSML: A Unified Framework for Sparse Metric Learning
There has been significant recent interest in sparse metric learning (SML) in which we simultaneously learn both a good distance metric and a low-dimensional representation. Unfor...
Kaizhu Huang, Yiming Ying, Colin Campbell
ICDM
2009
IEEE
107views Data Mining» more  ICDM 2009»
13 years 2 months ago
Naive Bayes Classification of Uncertain Data
Traditional machine learning algorithms assume that data are exact or precise. However, this assumption may not hold in some situations because of data uncertainty arising from mea...
Jiangtao Ren, Sau Dan Lee, Xianlu Chen, Ben Kao, R...
ICDM
2009
IEEE
145views Data Mining» more  ICDM 2009»
13 years 2 months ago
Significance of Episodes Based on Minimal Windows
Discovering episodes, frequent sets of events from a sequence has been an active field in pattern mining. Traditionally, a level-wise approach is used to discover all frequent epis...
Nikolaj Tatti
ICDM
2009
IEEE
156views Data Mining» more  ICDM 2009»
13 years 2 months ago
Scalable Classification in Large Scale Spatiotemporal Domains Applied to Voltage-Sensitive Dye Imaging
We present an approach for learning models that obtain accurate classification of large scale data objects, collected in spatiotemporal domains. The model generation is structured ...
Igor Vainer, Sarit Kraus, Gal A. Kaminka, Hamutal ...