Sciweavers

KDD
2003
ACM
175views Data Mining» more  KDD 2003»
14 years 5 months ago
Weighted Association Rule Mining using weighted support and significance framework
We address the issues of discovering significant binary relationships in transaction datasets in a weighted setting. Traditional model of association rule mining is adapted to han...
Feng Tao, Fionn Murtagh, Mohsen Farid
KDD
2003
ACM
191views Data Mining» more  KDD 2003»
14 years 5 months ago
Assessment and pruning of hierarchical model based clustering
The goal of clustering is to identify distinct groups in a dataset. The basic idea of model-based clustering is to approximate the data density by a mixture model, typically a mix...
Jeremy Tantrum, Alejandro Murua, Werner Stuetzle
KDD
2003
ACM
142views Data Mining» more  KDD 2003»
14 years 5 months ago
Mining phenotypes and informative genes from gene expression data
Mining microarray gene expression data is an important research topic in bioinformatics with broad applications. While most of the previous studies focus on clustering either gene...
Chun Tang, Aidong Zhang, Jian Pei
KDD
2003
ACM
122views Data Mining» more  KDD 2003»
14 years 5 months ago
Discovery of climate indices using clustering
To analyze the effect of the oceans and atmosphere on land climate, Earth Scientists have developed climate indices, which are time series that summarize the behavior of selected ...
Michael Steinbach, Pang-Ning Tan, Vipin Kumar, Ste...
KDD
2003
ACM
118views Data Mining» more  KDD 2003»
14 years 5 months ago
Generating English summaries of time series data using the Gricean maxims
We are developing technology for generating English textual summaries of time-series data, in three domains: weather forecasts, gas-turbine sensor readings, and hospital intensive...
Somayajulu Sripada, Ehud Reiter, Jim Hunter, Jin Y...
KDD
2003
ACM
142views Data Mining» more  KDD 2003»
14 years 5 months ago
Frequent-subsequence-based prediction of outer membrane proteins
A number of medically important disease-causing bacteria (collectively called Gram-negative bacteria) are noted for the extra "outer" membrane that surrounds their cell....
Rong She, Fei Chen 0002, Ke Wang, Martin Ester, Je...
KDD
2003
ACM
162views Data Mining» more  KDD 2003»
14 years 5 months ago
Improving spatial locality of programs via data mining
In most computer systems, page fault rate is currently minimized by generic page replacement algorithms which try to model the temporal locality inherent in programs. In this pape...
Karlton Sequeira, Mohammed Javeed Zaki, Boleslaw K...
KDD
2003
ACM
157views Data Mining» more  KDD 2003»
14 years 5 months ago
Cross-training: learning probabilistic mappings between topics
Classification is a well-established operation in text mining. Given a set of labels A and a set DA of training documents tagged with these labels, a classifier learns to assign l...
Sunita Sarawagi, Soumen Chakrabarti, Shantanu Godb...
KDD
2003
ACM
144views Data Mining» more  KDD 2003»
14 years 5 months ago
Clinical and financial outcomes analysis with existing hospital patient records
Existing patient records are a valuable resource for automated outcomes analysis and knowledge discovery. However, key clinical data in these records is typically recorded in unst...
R. Bharat Rao, Sathyakama Sandilya, Radu Stefan Ni...
KDD
2003
ACM
111views Data Mining» more  KDD 2003»
14 years 5 months ago
Critical event prediction for proactive management in large-scale computer clusters
Adam J. Oliner, Anand Sivasubramaniam, Irina Rish,...