Declarative Data Fusion - Syntax, Semantics, and Implementation

10 years 5 months ago
Declarative Data Fusion - Syntax, Semantics, and Implementation
In today’s integrating information systems data fusion, i.e., the merging of multiple tuples about the same real-world object into a single tuple, is left to ETL tools and other specialized software. While much attention has been paid to architecture, query languages, and query execution, the final step of actually fusing data from multiple sources into a consistent and homogeneous set is often ignored. This paper states the formal problem of data fusion in relational databases and discusses which parts of the problem can already be solved with standard Sql. To bridge the final gap, we propose the SQL Fuse By statement and define its syntax and semantics. A first implementation of the statement in a prototypical database system shows the usefulness and feasibility of the new operator. 1 Data Fusion Integrated (relational) information systems provide users with only one uniform view to different (relational) data sources. Querying the underlying different data sources, combining...
Jens Bleiholder, Felix Naumann
Added 26 Jun 2010
Updated 26 Jun 2010
Type Conference
Year 2005
Authors Jens Bleiholder, Felix Naumann
Comments (0)