Published July 5, 2022 | Version v1
Presentation Open

EMERGEN-BioInfo The digital platform for the French SARS-CoV-2 genomic surveillance and research program

  • 1. CNRS, Institut Français de Bioinformatique, IFB-core, UMS 3601, Évry, France
  • 2. CNRS UMR7104, Inserm U1258, Université de Strasbourg, IGBMC, Illkirch, France
  • 3. Sorbonne Université, CNRS FR3631, Inserm US037, IBPS, ARTbio Bioinformatics Analysis Facility, Paris, France
  • 4. Santé Publique France, 12, rue du Val d'Osne 94 415 Saint-Maurice Cedex
  • 5. Inserm, 101 rue de Tolbiac 75013 Paris
  • 6. Université Paris-Saclay, INRAE, MaIAGE, 78350 Jouy-en-Josas, France
  • 7. Sorbonne Université, CNRS FR3631, Inserm US037, (IBPS), ARTbio Bioinformatics Analysis Facility, Paris, France
  • 8. Sorbonne Université, CNRS, FR2424, ABiMS, Station Biologique, 29680, Roscoff, France

Description

We present EMERGEN-Bioinfo, the digital platform to collect, process, manage and divulgate viral sequences and non-sensitive metadata, developed in the context of EMERGEN, the French plan for COVID-19 genomic surveillance and research. The bioinformatics platform relies on different components to manage all the steps from raw sequence collection to deposition in international repositories. This includes: (1) specific storage spaces for each one of the 60 teams of the consortium; (2) a data lake gathering all sequences (raw, mapped, consensus genomes, aligned genomic and peptidic sequences); (3) system-level workflows to handle the data flow trough all the components of the platform; (4) a covid-19 specific domain of the national Galaxy server (covid19.usegalaxy.fr); (5) EMERGEN-DB, a database to store and manage non-sensitive metadata and genomic consensus sequences ; (6) data brokering services to facilitate metadata management and curation, submission to international repositories (GISAID and ENA) and follow-up of their acceptance status. The EMERGEN-Bioinfo platform is complemented by a high-security digital platform (EMERGEN-HDS) certified for Health Data Storage, which will enable researchers to pair EMERGEN data with patient data from different sources (national COVID-19 and healthcare databases). All the software resources developed for this projects will be accessible under an open license, and re-usable for other national projects (e.g. ABRomics multi-omics platform for surveillance and research on antimicrobial resistance) or international cooperation (e.g. sharing with partners of the European bioinformatics infrastructure ELIXIR).

Files

2022-07-05_JOBIM.pptx.pdf

Files (16.7 MB)

Name Size Download all
md5:9443206ba77ce78efb3358c6949877fb
16.7 MB Preview Download

Additional details

Related works

Has part
Poster: 10.5281/zenodo.6777531 (DOI)