A taxonomy of ETL activities

11 years 8 months ago
A taxonomy of ETL activities
Extract-Transform-Load (ETL) activities are software modules responsible for populating a data warehouse with operational data, which have undergone a series of transformations on their way to the warehouse. The whole process is very complex and of significant importance for the design and maintenance of the data warehouse. A plethora of commercial ETL tools are already available in the market. However, each one of them follows a different approach for the modeling of ETL activities; i.e., of the building blocks of an ETL workflow. As a result, so far there is no standard or unified approach for describing such activities. In this paper, we are working towards the identification of generic properties that characterize ETL activities. In doing so, we follow a black-box approach and provide a taxonomy that characterizes ETL activities in terms of the relationship of their input to their output and provide a normal form that is based on interpreted semantics for the black box activities....
Panos Vassiliadis, Alkis Simitsis, Eftychia Baikou
Added 28 May 2010
Updated 28 May 2010
Type Conference
Year 2009
Authors Panos Vassiliadis, Alkis Simitsis, Eftychia Baikousi
Comments (0)