Published February 18, 2026 | Version v1
Dataset Open

BIL genome catalogue

  • 1. ROR icon Wellcome Sanger Institute

Description

This repository contains 4,098 genomes from the Bifidobacterium infantis-longum (BIL) species complex analysed in the manuscript titled “Genomic atlas of Bifidobacterium infantis and B. longum informs infant probiotic design.”

These data are made available on an open access basis for research use only. Any person wishing to use these data for commercial purposes must first enter into an appropriate commercial licensing and benefit-sharing agreement with the relevant CHAIN participating country. For Malawi, the relevant authority is the Public Health Research Institute, Ministry of Health, Republic of Malawi.

Additional metadata associated with the CHAIN NCC study are archived on the Harvard Dataverse (https://doi.org/10.7910/DVN/X6FAGX). The data contain sensitive information about study participants and may include identifiers that could compromise confidentiality or lead to ethnic stigmatisation. Access to these data requires submission of a formal request for consideration by our Data Governance Committee. Email completed data request form to the Data Governance Committee at dgc@kemri-wellcome.org. The requester provides investigators details, variables requested, intended use of the dataset, potential risks of the study including risks to confidentiality of individuals or communities, potential benefits of the study including to participant communities, scientific capacity building or health policy and planned outputs (if analysis on dataset will result in publication or reports or presentations). The requester also needs to formally agree to the conditions and limitations for data sharing to avoid misuse of shared data. Processing of data requests takes between 4 weeks to 6 weeks from submission.

Files

Files (3.1 GB)

Name Size Download all
md5:77d34a3f68ec61cb3b22f1f1a1dc3438
3.1 GB Download

Additional details

Dates

Available
2026-02-18

References