Published December 11, 2023 | Version v2
Dataset Open

The Helicobacter pylori Genome Project (HpGP) Phase1 dataset and 255 H. pylori population reference dataset

Description

This repository holds the HpGP Phase 1 genomic dataset for Hp26695 and 1011 study samples. All 1012 genomic sequences were annotated using the NCBI Prokaryotic Genome Annotation Pipeline(PGAP). Also, it has 255 curated public available H. pylori genomic sequences used for population structure analysis in Thorell et al. Nature Communications, 14:8184 (2023).

You can check the NCBI BioProject website for the latest annotation and sequence updates.

https://www.ncbi.nlm.nih.gov/bioproject/?term=HpGP

Please cite the above-mentioned paper if you use the data.

Files

Files (3.3 GB)

Name Size Download all
md5:8273cedb31a5eb2340c5a82729e7f12a
759.8 MB Download
md5:ffa0c622a809d446820622437fd14778
2.6 GB Download
md5:d098c20319765e399b49f37a7fe0c0c1
24.3 kB Download
md5:0514239bdbaba7e55a77dc37993bba0b
95.0 kB Download

Additional details

Dates

Available
2023-12-20
HpGP published dataset

References

  • Nature Communications, 14:8184 (2023)