Published December 13, 2022 | Version v1
Dataset Open

Build and measurements of Linux kernel configurations across different versions

  • 1. INSA Rennes, IUF, IRISA/Inria
  • 2. PUC-Rio
  • 3. Université de Rennes 1
  • 4. CNRS

Description

With large scale and complex configurable systems, it is hard for users to choose the right combination of options (i.e., configurations) in order to obtain the wanted trade-off between functionality and performance goals such as speed or size. Machine learning can help in relating these goals to the configurable system options, and thus, predict the effect of options on the outcome, typically after a costly training step. However, many configurable systems evolve at such a rapid pace that it is impractical to retrain a new model from scratch for each new version. Taking the extreme case of the Linux kernel with its ≈ 14, 500 configuration options, we investigate how binary size predictions of kernel size degrade over successive versions (and how transfer learning can be adapted and applied to mitigate this degradation). 

We used and are sharing a unique and large dataset constituted of the binary sizes (compressed and non-compressed) of thousands of configurations for different versions of the kernel, spanning three years (4.13, 4.15, 4.20, 5.0, 5.4, 5.7, and 5.8). Overall, around 200K configurations over 10K+ options/features and 6 versions. 

This dataset has been used in the Transactions of Software Engineering (TSE) article "Transfer Learning Across Variants and Versions: The Case of Linux Kernel Size" (preprint: https://hal.inria.fr/hal-03358817) 

Notes

works at least on pandas version : 1.4.3 (for the pickle)

Files

Files (2.5 GB)

Name Size Download all
md5:0f5703cbe160865a0facc93a9b3784e4
892.0 MB Download
md5:acb5b18900651dd179b41094661c6c54
378.0 MB Download
md5:934dbc3704cba6c02a5aa48873180fc2
243.5 MB Download
md5:8aca469ee01700c68ecb7a5937a2ba95
208.9 MB Download
md5:d6e791f0648c15766915c8a9807da80c
284.0 MB Download
md5:e5d6074f57ae31cf3d87d7746b332d8e
232.2 MB Download
md5:abe24e9af1a7dfc02f03f29bd30b3a13
256.7 MB Download

Additional details

Related works

Has part
Journal article: https://hal.inria.fr/hal-03358817 (URL)