Published October 11, 2024 | Version v1
Project deliverable Open

GDI D6.3 - Report on requirements for data quality and distributed analysis, as well as external resource interoperability

  • 1. Health-RI
  • 2. CRG
  • 3. VIB

Description

This deliverable describes the progress towards the understanding of data needs in the 1+MG infrastructure that are born from the fact that we are not building a data lake: not all data that needs to be analysed will be in the infrastructure, and the data that is in the infrastructure will not be all in the same place. The architecture of the infrastructure will make this as transparent as possible to the users of the data, but can’t completely make the distributed nature invisible.

For the data that is in different places, agreements must be made about what data is included and how it is presented.

To be able to co-analyse data that is external to the 1+MG infrastructure, agreements must be in place about the interfaces and practical setup that is used to link the internal to the external data. This linkage is going to take place within each country and therefore is subject to the ethical and legal landscape in each country separately.

Files

202410 - GDI_D6.3 Report on requirements for data quality and distributed analysis, as well as external resource interoperability.pdf

Additional details

Funding

European Commission
European Genomic Data Infrastructure (GDI) 101081813

Dates

Submitted
2024-10-14