Sciweavers

ICDM
2008
IEEE
104views Data Mining» more  ICDM 2008»
13 years 11 months ago
A Generative Probabilistic Model for Multi-label Classification
Hongning Wang, Minlie Huang, Xiaoyan Zhu
ICDM
2008
IEEE
155views Data Mining» more  ICDM 2008»
13 years 11 months ago
Alert Detection in System Logs
We present Nodeinfo, an unsupervised algorithm for anomaly detection in system logs. We demonstrate Nodeinfo’s effectiveness on data from four of the world’s most powerful sup...
Adam J. Oliner, Alex Aiken, Jon Stearley
ICDM
2008
IEEE
80views Data Mining» more  ICDM 2008»
13 years 11 months ago
Collective Latent Dirichlet Allocation
In this paper, we propose a new variant of Latent Dirichlet Allocation(LDA): Collective LDA (C-LDA), for multiple corpora modeling. C-LDA combines multiple corpora during learning...
Zhiyong Shen, Jun Sun, Yi-Dong Shen
ICDM
2008
IEEE
183views Data Mining» more  ICDM 2008»
13 years 11 months ago
Collaborative Filtering for Implicit Feedback Datasets
A common task of recommender systems is to improve customer experience through personalized recommendations based on prior implicit feedback. These systems passively track differe...
Yifan Hu, Yehuda Koren, Chris Volinsky
ICDM
2008
IEEE
156views Data Mining» more  ICDM 2008»
13 years 11 months ago
Exploiting Local and Global Invariants for the Management of Large Scale Information Systems
This paper presents a data oriented approach to modeling the complex computing systems, in which an ensemble of correlation models are discovered to represent the system status. I...
Haifeng Chen, Haibin Cheng, Guofei Jiang, Kenji Yo...
ICDM
2008
IEEE
86views Data Mining» more  ICDM 2008»
13 years 11 months ago
Mining Large Networks with Subgraph Counting
Ilaria Bordino, Debora Donato, Aristides Gionis, S...
ICDM
2008
IEEE
160views Data Mining» more  ICDM 2008»
13 years 11 months ago
Direct Zero-Norm Optimization for Feature Selection
Zero-norm, defined as the number of non-zero elements in a vector, is an ideal quantity for feature selection. However, minimization of zero-norm is generally regarded as a combi...
Kaizhu Huang, Irwin King, Michael R. Lyu
ICDM
2008
IEEE
185views Data Mining» more  ICDM 2008»
13 years 11 months ago
Clustering Uncertain Data Using Voronoi Diagrams
We study the problem of clustering uncertain objects whose locations are described by probability density functions (pdf). We show that the UK-means algorithm, which generalises t...
Ben Kao, Sau Dan Lee, David W. Cheung, Wai-Shing H...
ICDM
2008
IEEE
130views Data Mining» more  ICDM 2008»
13 years 11 months ago
Text Cube: Computing IR Measures for Multidimensional Text Database Analysis
Since Jim Gray introduced the concept of ”data cube” in 1997, data cube, associated with online analytical processing (OLAP), has become a driving engine in data warehouse ind...
Cindy Xide Lin, Bolin Ding, Jiawei Han, Feida Zhu,...
ICDM
2008
IEEE
184views Data Mining» more  ICDM 2008»
13 years 11 months ago
Bayesian Co-clustering
In recent years, co-clustering has emerged as a powerful data mining tool that can analyze dyadic data connecting two entities. However, almost all existing co-clustering techniqu...
Hanhuai Shan, Arindam Banerjee