A Machine Learning Approach to Foreign Key Discovery

13 years 11 months ago

Download webdb09.cse.buffalo.edu

We study the problem of automatically discovering semantic associations between schema elements, namely foreign keys. This problem is important in all applications where data sets need to be integrated that are structured in tables but without explicit foreign key constraints. If such constraints could be recovered automatically, querying and integrating such databases would become much easier. Clearly, one may find candidates for foreign key constraints in a given database instance by computing all inclusion dependencies (IND) between attributes. However, this set usually contains many false positives due to spurious set inclusions. We present a machine learning approach to tackle this problem. We first compute all INDs of a given schema and let each be judged by a binary classification algorithm using a small set of features that can be derived efficiently using standard SQL. We demonstrate the feasibility of this approach using crossvalidation with several state-of-the-art classifi...

Alexandra Rostin, Oliver Albrecht, Jana Bauckmann,

Real-time Traffic

Data Sets | Foreign Key | Foreign Key Constraints | Internet Technology | WEBDB 2009 |

claim paper

Related Content

» On MultiColumn Foreign Key Discovery

» Applying Data Mining and Machine Learning Techniques to Submarine Intelligence Analysis

» Discretization of Target Attributes for Subgroup Discovery

» An Intelligent Agent That Autonomously Learns How to Translate

» Layered critical values a powerful directadjustment approach to discovering significant pa...

» Genetic Programmingbased Construction of Features for Machine Learning and Knowledge Disco...

» Ubiquitous Knowledge Discovery

» DICE A Discovery Environment Integrating Inductive Bias

» Discovery of Concurrent Data Models from Experimental Tables A Rough Set Approach

Post Info
More Details (n/a)

Added	25 May 2010
Updated	25 May 2010
Type	Conference
Year	2009
Where	WEBDB
Authors	Alexandra Rostin, Oliver Albrecht, Jana Bauckmann, Felix Naumann, Ulf Leser

Comments (0)

Sciweavers

A Machine Learning Approach to Foreign Key Discovery

Data Sets | Foreign Key | Foreign Key Constraints | Internet Technology | WEBDB 2009 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers