Published June 5, 2026 | Version v2

Portable Microhaplotype Object - Example Datasets

  • 1. ROR icon University of California, San Francisco

Description

All example datasets for 'The Portable Microhaplotype Object and Tools' manuscript are available. The archive contains five folders, each corresponding to one dataset:

  • Dataset1: Public genomic surveillance data of Plasmodium falciparum from four countries: Eswatini, Namibia, South Africa, and Zambia.

  • ANOSPP: Combined Anopheles and Plasmodium data.

  • mips_v_mad4hatter: Data from the MAD4HatTeR amplicon sequencing assay and the DR23K molecular inversion probe (MIP) assay comparison.

  • E_coli: Escherichia coli datasets sourced from the Sequence Read Archive (SRA).

  • S_aureus: Staphylococcus aureus datasets sourced from SRA.

For Dataset1 and ANOSPP, the archive includes all raw data files as well as Jupyter notebooks used to generate the PMO. Dataset1 additionally includes the notebook used to produce Figure 3. PMOs for all five datasets are included.

Further details about the file structure and contents are provided in the README.txt file with the data.

 

Files

example_datasets.zip

Files (21.5 MB)

Name Size Download all
md5:400f142ed8b747cb69c6e615039835e8
21.5 MB Preview Download

Additional details

Funding

National Institute of Allergy and Infectious Diseases
Data and analysis ecosystem for eukaryotic pathogen targeted sequencing 4U01AI184646-02

Software

Programming language
Python