Sciweavers

KDD
2006
ACM
130views Data Mining» more  KDD 2006»
14 years 4 months ago
Discovering significant rules
In many applications, association rules will only be interesting if they represent non-trivial correlations between all constituent items. Numerous techniques have been developed ...
Geoffrey I. Webb
KDD
2006
ACM
129views Data Mining» more  KDD 2006»
14 years 4 months ago
Suppressing model overfitting in mining concept-drifting data streams
Mining data streams of changing class distributions is important for real-time business decision support. The stream classifier must evolve to reflect the current class distributi...
Haixun Wang, Jian Yin, Jian Pei, Philip S. Yu, Jef...
KDD
2006
ACM
147views Data Mining» more  KDD 2006»
14 years 4 months ago
Summarizing itemset patterns using probabilistic models
In this paper, we propose a novel probabilistic approach to summarize frequent itemset patterns. Such techniques are useful for summarization, post-processing, and end-user interp...
Chao Wang, Srinivasan Parthasarathy
KDD
2006
ACM
177views Data Mining» more  KDD 2006»
14 years 4 months ago
Topics over time: a non-Markov continuous-time model of topical trends
This paper presents an LDA-style topic model that captures not only the low-dimensional structure of data, but also how the structure changes over time. Unlike other recent work t...
Xuerui Wang, Andrew McCallum
KDD
2006
ACM
166views Data Mining» more  KDD 2006»
14 years 4 months ago
Anonymizing sequential releases
An organization makes a new release as new information become available, releases a tailored view for each data request, releases sensitive information and identifying information...
Ke Wang, Benjamin C. M. Fung
KDD
2006
ACM
155views Data Mining» more  KDD 2006»
14 years 4 months ago
Camouflaged fraud detection in domains with complex relationships
We describe a data mining system to detect frauds that are camouflaged to look like normal activities in domains with high number of known relationships. Examples include accounti...
Sankar Virdhagriswaran, Gordon Dakin
KDD
2006
ACM
161views Data Mining» more  KDD 2006»
14 years 4 months ago
Efficient kernel feature extraction for massive data sets
Ivor W. Tsang, András Kocsor, James T. Kwok
KDD
2006
ACM
142views Data Mining» more  KDD 2006»
14 years 4 months ago
Mining distance-based outliers from large databases in any metric space
Let R be a set of objects. An object o R is an outlier, if there exist less than k objects in R whose distances to o are at most r. The values of k, r, and the distance metric ar...
Yufei Tao, Xiaokui Xiao, Shuigeng Zhou
KDD
2006
ACM
143views Data Mining» more  KDD 2006»
14 years 4 months ago
Mining long-term search history to improve search accuracy
Long-term search history contains rich information about a user's search preferences. In this paper, we study statistical language modeling based methods to mine contextual i...
Bin Tan, Xuehua Shen, ChengXiang Zhai