Published October 23, 2024 | Version v1
Project deliverable Open

BY-COVID D2.4 Report on data sources discovery and integration for enabling data use and re-use in response to future outbreaks

  • 1. ROR icon Barcelona Supercomputing Center
  • 1. KNAW-DANS
  • 2. TAU-FSD
  • 3. ROR icon European Clinical Research Infrastructure Network
  • 4. EKKE
  • 5. EUI
  • 6. ROR icon Lygature
  • 7. VUmc
  • 8. ROR icon Euro-BioImaging ERIC
  • 9. IACS
  • 10. ROR icon Sciensano (Belgium)
  • 11. ROR icon Fraunhofer Institute for Translational Medicine and Pharmacology
  • 12. ROR icon EU-OPENSCREEN
  • 13. EMBL-EBI
  • 14. ROR icon Biobanking and Biomolecular Resources Research Infrastructure Consortium
  • 15. ROR icon ELIXIR

Description

In the face of rapid technological advancements and the challenges posed by global health crises like the COVID-19 pandemic, the ability to efficiently discover and integrate diverse data sources is crucial. This report explores innovative mechanisms for data discovery and integration, focusing on strategies beyond discovery at source methods (which are part of deliverable 2.3), such as the Beacon. While the Beacon provides foundational insights into direct data discovery, building a more robust and flexible data infrastructure requires exploring alternative approaches that enhance data use and re-use across various platforms and disciplines. These strategies aim to facilitate seamless data integration, enabling more effective and timely responses to future health emergencies.

The report concentrates on the discovery and integration of three primary categories of data: non-patient data, human clinical and health data, and socioeconomic data. Each data type presents unique challenges and opportunities for comprehensive outbreak responses. The report incorporates lessons learned from BY-COVID use cases, providing practical insights into the complexities of data discovery and integration. By integrating these real-world examples, the report ensures that its recommendations are not only theoretically sound but also practically viable and effective in real-world scenarios.

The connection between WP2 and other WPs within the BY-COVID project is also highlighted in the report. WP2 worked closely with WP3 to maintain the FAIRsharing collection for BY-COVID, which includes diverse data types such as non-patient data, human clinical data, and socioeconomic data. The collaboration extended to enhancing the COVID-19 Data Portal, integrating various datasets, including host sequences, imaging, and social sciences. These efforts involved developing tools for data mobilisation, ensuring that datasets are easily discoverable and usable within the portal.

Additionally, WP2 contributed to the Infectious Diseases Toolkit (IDTk) by developing dedicated pages for data sources and analysis, aligning closely with WP4's objectives. Lessons learned from the baseline use case in WP5 emphasised the importance of seamless data integration and mobilisation of sensitive health data, ensuring compliance with legal and ethical standards. WP2 also collaborated with WP6 on training activities, including workshops on using the Beacon tool for sensitive data discovery and FAIR bioimage data management. These collaborations across work packages underscore the project's commitment to enhancing data accessibility, interoperability, and reusability, ultimately supporting global health preparedness and response efforts.

Files

BY-COVID_D_2.4 - Report on data sources discovery and integration for enabling data use and re-use in response to future outbreaks.pdf

Additional details

Funding

BY-COVID – Beyond COVID 101046203
European Commission

Dates

Submitted
2024-09-30