Published February 27, 2024 | Version v1
Preprint Open

Variability in data transformation: towards data migration product lines

  • 1. ROR icon Universidad de Sevilla
  • 2. ROR icon Universidad de Málaga

Description

Software evolution often requires data management and more concretely data migration. Data migration follows an ETL process: extracting (E) data from a source, transforming (T) the data depending on migration needs, and loading (L) the data in a target data storage. Data migration projects are recognised to be complex and challenging to manage, which can lead to resource loss and planning delays. Among the reasons for data migration project failure is the lack of systematic artifact reuse (e.g., transformation script) in the data migration process. Every new data migration project is often developed from scratch. Software product line (SPL) engineering has been applied in many different domains to systematically reuse artifacts (e.g., code platforms, test cases) in software development processes and there are many positive experiences when applying SPL to reduce cost and time. In this paper, we present an approach using SPL techniques for data migration projects, concretely, in the data transformation stage. Our solution facilitates the automated creation of scripts that can be reused in different data migration projects. The feasibility of the proposal is validated in the domain of web information systems modernization. The validation shows how various migration scripts can be created to transform data between different content management systems. With this work, new opportunities are opened for studying the synergies of SPL and data migration. To the best of our knowledge, this is the first proposal that uses a complete stack of SPL that materializes the reuse of artifacts for data migration.

Notes

This work was supported by the TASOVA PLUS research network (RED2022-134337-T)
funded by FEDER/Ministry of Science and Innovation — State Research Agency. The
work from the University of Seville was supported by the project Data-pl funded by
FEDER/Ministry of Science and Innovation; the COPERNICA (P20_01224) and META-
MORFOSIS (FEDER_US-1381375) projects funded by Junta de Andalucía. The work from
the University of Málaga was supported by the projects IRIS PID2021-122812OB-I00
(co-financed by FEDER funds), LEIA UMA18-FEDERJA-157, and DAEMON H2020-101017109.

Files

VAMOS_24_US___Transformo (1).pdf

Files (1.2 MB)

Name Size Download all
md5:76e762e8cdb6aafe1b113f162ee72a17
1.2 MB Preview Download

Additional details

Funding

European Commission
DAEMON – Network intelligence for aDAptive and sElf-Learning MObile Networks 101017109