Sciweavers

ICDM
2010
IEEE
166views Data Mining» more  ICDM 2010»
13 years 1 months ago
Exponential Family Tensor Factorization for Missing-Values Prediction and Anomaly Detection
In this paper, we study probabilistic modeling of heterogeneously attributed multi-dimensional arrays. The model can manage the heterogeneity by employing an individual exponential...
Kohei Hayashi, Takashi Takenouchi, Tomohiro Shibat...
ICDM
2010
IEEE
135views Data Mining» more  ICDM 2010»
13 years 1 months ago
Learning a Bi-Stochastic Data Similarity Matrix
An idealized clustering algorithm seeks to learn a cluster-adjacency matrix such that, if two data points belong to the same cluster, the corresponding entry would be 1; otherwise ...
Fei Wang, Ping Li, Arnd Christian König
ICDM
2010
IEEE
216views Data Mining» more  ICDM 2010»
13 years 1 months ago
Data Editing Techniques to Allow the Application of Distance-Based Outlier Detection to Streams
The problem of finding outliers in data has broad applications in areas as diverse as data cleaning, fraud detection, network monitoring, invasive species monitoring, etc. While th...
Vit Niennattrakul, Eamonn J. Keogh, Chotirat Ann R...
ICDM
2010
IEEE
142views Data Mining» more  ICDM 2010»
13 years 1 months ago
Causal Discovery from Streaming Features
In this paper, we study a new research problem of causal discovery from streaming features. A unique characteristic of streaming features is that not all features can be available ...
Kui Yu, Xindong Wu, Hao Wang, Wei Ding
ICDM
2010
IEEE
198views Data Mining» more  ICDM 2010»
13 years 1 months ago
Hierarchical Ensemble Clustering
Ensemble clustering has emerged as an important elaboration of the classical clustering problems. Ensemble clustering refers to the situation in which a number of different (input)...
Li Zheng, Tao Li, Chris H. Q. Ding
ICDM
2010
IEEE
134views Data Mining» more  ICDM 2010»
13 years 1 months ago
Consequences of Variability in Classifier Performance Estimates
The prevailing approach to evaluating classifiers in the machine learning community involves comparing the performance of several algorithms over a series of usually unrelated data...
Troy Raeder, T. Ryan Hoens, Nitesh V. Chawla
ICDM
2010
IEEE
121views Data Mining» more  ICDM 2010»
13 years 1 months ago
Recommending Social Events from Mobile Phone Location Data
A city offers thousands of social events a day, and it is difficult for dwellers to make choices. The combination of mobile phones and recommender systems can change the way one de...
Daniele Quercia, Neal Lathia, Francesco Calabrese,...
ICDM
2010
IEEE
126views Data Mining» more  ICDM 2010»
13 years 1 months ago
Adaptive Distances on Sets of Vectors
Adam Woznica, Alexandros Kalousis
ICDM
2010
IEEE
150views Data Mining» more  ICDM 2010»
13 years 1 months ago
Probabilistic Inference Protection on Anonymized Data
Background knowledge is an important factor in privacy preserving data publishing. Probabilistic distributionbased background knowledge is a powerful kind of background knowledge w...
Raymond Chi-Wing Wong, Ada Wai-Chee Fu, Ke Wang, Y...
ICDM
2010
IEEE
213views Data Mining» more  ICDM 2010»
13 years 1 months ago
Modeling Experts and Novices in Citizen Science Data for Species Distribution Modeling
Citizen scientists, who are volunteers from the community that participate as field assistants in scientific studies [3], enable research to be performed at much larger spatial and...
Jun Yu, Weng-Keen Wong, Rebecca A. Hutchinson