Sciweavers

ICDM
2005
IEEE
116views Data Mining» more  ICDM 2005»
13 years 10 months ago
Learning Functional Dependency Networks Based on Genetic Programming
Bayesian Network (BN) is a powerful network model, which represents a set of variables in the domain and provides the probabilistic relationships among them. But BN can handle dis...
Wing-Ho Shum, Kwong-Sak Leung, Man Leung Wong
ICDM
2005
IEEE
137views Data Mining» more  ICDM 2005»
13 years 10 months ago
Leveraging Relational Autocorrelation with Latent Group Models
The presence of autocorrelation provides a strong motivation for using relational learning and inference techniques. Autocorrelation is a statistical dependence between the values...
Jennifer Neville, David Jensen
ICDM
2005
IEEE
118views Data Mining» more  ICDM 2005»
13 years 10 months ago
A Heterogeneous Field Matching Method for Record Linkage
Record linkage is the process of determining that two records refer to the same entity. A key subprocess is evaluating how well the individual fields, or attributes, of the recor...
Steven Minton, Claude Nanjo, Craig A. Knoblock, Ma...
ICDM
2005
IEEE
128views Data Mining» more  ICDM 2005»
13 years 10 months ago
An Expected Utility Approach to Active Feature-Value Acquisition
In many classification tasks training data have missing feature values that can be acquired at a cost. For building accurate predictive models, acquiring all missing values is of...
Prem Melville, Foster J. Provost, Raymond J. Moone...
ICDM
2005
IEEE
148views Data Mining» more  ICDM 2005»
13 years 10 months ago
A Graph-Ranking Algorithm for Geo-Referencing Documents
This paper presents an application of PageRank for assigning documents with a corresponding geographical scope. We describe the technique in detail, together with its theoretical ...
Bruno Martins, Mário J. Silva
ICDM
2005
IEEE
188views Data Mining» more  ICDM 2005»
13 years 10 months ago
CLUMP: A Scalable and Robust Framework for Structure Discovery
We introduce a robust and efficient framework called CLUMP (CLustering Using Multiple Prototypes) for unsupervised discovery of structure in data. CLUMP relies on finding multip...
Kunal Punera, Joydeep Ghosh
ICDM
2005
IEEE
125views Data Mining» more  ICDM 2005»
13 years 10 months ago
Alternate Representation of Distance Matrices for Characterization of Protein Structure
The most suitable method for the automated classification of protein structures remains an open problem in computational biology. In order to classify a protein structure with an...
Keith Marsolo, Srinivasan Parthasarathy
ICDM
2005
IEEE
115views Data Mining» more  ICDM 2005»
13 years 10 months ago
Spatial Clustering of Chimpanzee Locations for Neighborhood Identification
Sandeep Mane, Carson Murray, Shashi Shekhar, Jaide...
ICDM
2005
IEEE
135views Data Mining» more  ICDM 2005»
13 years 10 months ago
Bit Reduction Support Vector Machine
Abstract— Support vector machines are very accurate classifiers and have been widely used in many applications. However, the training and to a lesser extent prediction time of s...
Tong Luo, Lawrence O. Hall, Dmitry B. Goldgof, And...
ICDM
2005
IEEE
187views Data Mining» more  ICDM 2005»
13 years 10 months ago
Parallel Algorithms for Distance-Based and Density-Based Outliers
An outlier is an observation that deviates so much from other observations as to arouse suspicion that it was generated by a different mechanism. Outlier detection has many applic...
Elio Lozano, Edgar Acuña