EMERGEN-BioInfo The digital platform for the French SARS-CoV-2 genomic surveillance and research program
Authors/Creators
-
Denecker, Thomas1
-
Messak, Imane1
-
Mohamed, Anliat1
-
Antoinat, Chiara1
- Le Bars, Arthur1
- Tonazzolli, Arianna1
- Demaille, Benjamin1
- Sand, Olivier1
- Gerbes, François1
- Rosnet, Thomas1
- Bouri, Laurent1
- Seiler, Julien2
- Charrière, Nicole1
- Antoniewski, Christophe3
- Bozorgan, Anne4
- Castro Alvarez, Javier4
- Sudour, Jeanne4
- Le Strat, Yann4
- Coignard, Bruno4
- Amzert, Abdelkader5
- Gharbi, Nebras5
- Lethimonier, Franck5
-
Chiapello, Hélène6
-
Naouar, Naira7
- Médigue, Claudine1
- Le Corguillé, Gildas8
- Salgado, David1
-
van Helden, Jacques1
- 1. CNRS, Institut Français de Bioinformatique, IFB-core, UMS 3601, Évry, France
- 2. CNRS UMR7104, Inserm U1258, Université de Strasbourg, IGBMC, Illkirch, France
- 3. Sorbonne Université, CNRS FR3631, Inserm US037, IBPS, ARTbio Bioinformatics Analysis Facility, Paris, France
- 4. Santé Publique France, 12, rue du Val d'Osne 94 415 Saint-Maurice Cedex
- 5. Inserm, 101 rue de Tolbiac 75013 Paris
- 6. Université Paris-Saclay, INRAE, MaIAGE, 78350 Jouy-en-Josas, France
- 7. Sorbonne Université, CNRS FR3631, Inserm US037, (IBPS), ARTbio Bioinformatics Analysis Facility, Paris, France
- 8. Sorbonne Université, CNRS, FR2424, ABiMS, Station Biologique, 29680, Roscoff, France
Description
We present EMERGEN-Bioinfo, the digital platform to collect, process, manage and divulgate viral sequences and non-sensitive metadata, developed in the context of EMERGEN, the French plan for COVID-19 genomic surveillance and research. The bioinformatics platform relies on different components to manage all the steps from raw sequence collection to deposition in international repositories. This includes: (1) specific storage spaces for each one of the 60 teams of the consortium; (2) a data lake gathering all sequences (raw, mapped, consensus genomes, aligned genomic and peptidic sequences); (3) system-level workflows to handle the data flow trough all the components of the platform; (4) a covid-19 specific domain of the national Galaxy server (covid19.usegalaxy.fr); (5) EMERGEN-DB, a database to store and manage non-sensitive metadata and genomic consensus sequences ; (6) data brokering services to facilitate metadata management and curation, submission to international repositories (GISAID and ENA) and follow-up of their acceptance status. The EMERGEN-Bioinfo platform is complemented by a high-security digital platform (EMERGEN-HDS) certified for Health Data Storage, which will enable researchers to pair EMERGEN data with patient data from different sources (national COVID-19 and healthcare databases). All the software resources developed for this projects will be accessible under an open license, and re-usable for other national projects (e.g. ABRomics multi-omics platform for surveillance and research on antimicrobial resistance) or international cooperation (e.g. sharing with partners of the European bioinformatics infrastructure ELIXIR).
Files
2022-07-05_JOBIM.pptx.pdf
Files
(16.7 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:9443206ba77ce78efb3358c6949877fb
|
16.7 MB | Preview Download |
Additional details
Related works
- Has part
- Poster: 10.5281/zenodo.6777531 (DOI)