Sciweavers

94 search results - page 11 / 19
» kdd 2003
Sort
View
KDD
2003
ACM
128views Data Mining» more  KDD 2003»
15 years 11 months ago
Similarity analysis on government regulations
Government regulations are semi-structured text documents that are often voluminous, heavily cross-referenced between provisions and even ambiguous. Multiple sources of regulation...
Gloria T. Lau, Kincho H. Law, Gio Wiederhold
KDD
2003
ACM
113views Data Mining» more  KDD 2003»
15 years 11 months ago
Mining unexpected rules by pushing user dynamics
Unexpected rules are interesting because they are either previously unknown or deviate from what prior user knowledge would suggest. In this paper, we study three important issues...
Ke Wang, Yuelong Jiang, Laks V. S. Lakshmanan
71
Voted
KDD
2003
ACM
148views Data Mining» more  KDD 2003»
15 years 11 months ago
A highly-usable projected clustering algorithm for gene expression profiles
Projected clustering has become a hot research topic due to its ability to cluster high-dimensional data. However, most existing projected clustering algorithms depend on some cri...
Kevin Y. Yip, David W. Cheung, Michael K. Ng
65
Voted
KDD
2003
ACM
109views Data Mining» more  KDD 2003»
15 years 11 months ago
Generative model-based clustering of directional data
High dimensional directional data is becoming increasingly important in contemporary applications such as analysis of text and gene-expression data. A natural model for multivaria...
Arindam Banerjee, Inderjit S. Dhillon, Joydeep Gho...
73
Voted
KDD
2003
ACM
156views Data Mining» more  KDD 2003»
15 years 11 months ago
Mining distance-based outliers in near linear time with randomization and a simple pruning rule
Defining outliers by their distance to neighboring examples is a popular approach to finding unusual examples in a data set. Recently, much work has been conducted with the goal o...
Stephen D. Bay, Mark Schwabacher