Published December 1, 2023 | Version v1
Conference proceeding Open

Facilitating future open data reuse via continuous integration of actionable data analysis examples

  • 1. ROR icon European Organization for Nuclear Research

Description

We describe a use case of facilitating open data reuse in experimental particle physics. The CERN Open Data portal disseminates over three petabytes of data published by LHC collaborations. The data is accompanied by preserved computational analysis environments and various analysis examples documenting and illustrating data access and reuse patterns. We have developed a "continuous integration" system allowing to run actionable code examples in their original computing environments using the REANA reproducible analysis workflows in order to ensure the validity of data access patterns across time. We conclude by advocating the importance of early preservation of actionable software examples alongside datasets as a key action to facilitate future data reuse.

Files

pv2023_continuous_reuse.pdf

Files (1.1 MB)

Name Size Download all
md5:43c5cc24a9923b45c70e33ce1c356b8a
1.1 MB Preview Download