Sciweavers

KDD
1998
ACM

Methods for Linking and Mining Massive Heterogeneous Databases

13 years 8 months ago
Methods for Linking and Mining Massive Heterogeneous Databases
Manyreal-world KDDexpeditions involve investigation of relationships betweenvariables in different, heterogeneousdatabases. Wepresent a dynamic programmingtechnique for linking records in multiple heterogeneousdatabases usinglooselydefinedfields that allowfree-style verbatim entries. Wedevelop an interestingness measurebased on non-parametric randomization tests, whichcan be used for miningpotentially useful relationships amongvariables. This measure usesdistributional characteristics of historical events, hence accommodatingvariable-length records in a natural way.Asan illustration, we include a successful application of the proposed methodologyto a real-world data miningproblem at LucentTechnologies.
José C. Pinheiro, Don X. Sun
Added 06 Aug 2010
Updated 06 Aug 2010
Type Conference
Year 1998
Where KDD
Authors José C. Pinheiro, Don X. Sun
Comments (0)