There is a newer version of the record available.

Published September 27, 2023 | Version v1
Dataset Open

Data Set for Enhanced Performance Prediction of ATL Model Transformations

  • 1. Institute of Software Engineering and Programming Languages, Ulm University
  • 2. Institute of Medical Systems Biology, Ulm University
  • 3. Institute of Neural Information Processing, Institute of Medical Systems Biology, Ulm University
  • 4. Institute of Neural Information Processing, Ulm University

Description

Model transformation languages are domain-specific languages, which are designed to comfortably define transformations. With the increasing use of transformations in various domains, the complexity and size of input models are also increasing. However, developers often lack suitable models for performance testing. We have therefore conducted experiments in which we predict the performance of model transformations based on characteristics of input models using machine learning approaches. In particular, we focused on how to predict the performance of transformations that also transform attributes whose values can have arbitrary size. This dataset contains our raw and processed input data, the scripts necessary to repeat our experiments, and the results we obtained.

Our input data consists of the time measurements for six different transformations defined in the Atlas Transformation Language (ATL), as well as the collected characteristics of the real-world input models we used. In this data set, we provide the script that implements our experiments. We predict the execution time of ATL transformations using the machine learning approaches linear regression, random forests and support vector regression using a radial basis function kernel. We also investigate different sets of characteristics of input models as input for the machine learning approaches. These are described in detail in the provided documentation.pdf. The results of the experiments are provided as raw data in individual cvs files. Furthermore, we provide our Eclipse plugin, which collects the characteristics for a set of given models.

A detailed documentation is available in documentaion.pdf.

Notes

This work was partially funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) - 358569332. The authors acknowledge support by the state of Baden-Württemberg through bwHPC.

Files

DataSetForEnhancedPerformancePredictionOfATLModelTransformations.zip

Files (36.8 MB)

Name Size Download all
md5:d840b59416f09ac237a51734394fc75a
36.3 MB Preview Download
md5:31585cd9ba88a021013bfb68e9c94115
533.7 kB Preview Download