Published April 2, 2024 | Version v1
Other Open

Interoperability of Provenance Information in Clinical Research: a Case Study Combining HL7 FHIR and the Common Provenance Model - Supplementary materials

  • 1. ROR icon Center for Advanced Studies Research and Development in Sardinia
  • 2. ROR icon Masaryk University
  • 3. ROR icon Biobanking and Biomolecular Resources Research Infrastructure Consortium
  • 1. ROR icon Masaryk University
  • 2. ROR icon Biobanking and Biomolecular Resources Research Infrastructure Consortium
  • 3. ROR icon Medical University of Graz
  • 4. ROR icon University of Würzburg

Description

This page contains the Supplementary Material for the paper Interoperability of Provenance Information in Clinical Research: a Case Study Combining HL7 FHIR and the Common Provenance Model, submitted for the 34th Medical Informatics Europe Conference (MIE2024).

This case study presents a simulated example of how the Common Provenance Model (CPM) can be applied in a simple research use case in the biomedical domain in combination with domain-specific provenance tracking methods, like the HL7 FHIR Provenance resource. Here we show how to express the provenance of a distributed process in which a large dataset related to colorectal cancer cohort cases (CRC-Cohort) is converted to HL7 FHIR resources and queried. In particular, the CRC-Cohort dataset is a collection of harmonised data related to over 10,000 cases of colorectal cancer samples collected by several European biobanks under the coordination of the national nodes of the Biobanking and Biomolecular Resources Research Infrastructure – European Research Infrastructure Consortium (BBMRI-ERIC) within the European Project ADOPT.

List of available resources:

File name Description
CRC_Cohort_ETL_and_query_provenance.ipynb Python notebook used to generate and serialise the provenance of the use case.
prov_graphs.zip

A folder containing the outputs of the script, which are:

  • documents containing the serialisation of the provenance information according to the PRON-N syntax;
  • images depicting the diagrams of the provenance information following the W3C PROV graph convention.
HL7 FHIR resources.zip

A folder containing a small set of synthetic HL7 FHIR Resources generated for this case study, in particular:

  • resources of different types (Patient, Condition, Specimen, Device, DocumentReference, Location, Organization) containing data equivalent to those obtained from the conversion of the source dataset;
  • provenance resources documenting the generation of the FHIR resources containing the data.

 

 

Files

CRC_Cohort_ETL_and_query_provenance.ipynb

Files (853.8 kB)

Name Size Download all
md5:6c5f2492a3344570aed9945d141b9737
501.2 kB Preview Download
md5:cae1b1a0f48d735d877956858ad58f87
12.7 kB Preview Download
md5:9af957a826a782b3ed9c42b44170002e
339.9 kB Preview Download

Additional details

Funding

European Commission
Beyond COVID 101046203
European Commission
Accelerating datafication for support of EU health priorities, greening of biobanks and integrated approach to “One Health” 101131701

Dates

Submitted
2024-04-03
Date of submission of the paper to the MIE2024 conference editors for review.