Published February 27, 2026 | Version 1.0
Dissertation Restricted

Phylogenetic Analysis and Species Delineation of Corynebacterium Isolates using Whole-Genome Sequencing

Contributors

Supervisor:

Description

This dataset contains raw sequencing data, genome assemblies and derived analyses associated with the Master’s thesis:

Mundt, C.J.F. (2026). Phylogenetic Analysis and Species Delineation of Corynebacterium Isolates using Whole-Genome Sequencing.

The project implements a genome-based taxonomic framework for species delineation within the genus Corynebacterium using whole-genome sequencing data.

The dataset includes:

  • Raw Illumina paired-end reads

  • Read-level QC (FastQC)
  • De novo genome assemblies

  • Assembly quality assessment summaries (QUAST, CheckM2)

  • Genome annotations (Prokka)

  • Pairwise ANI matrices (FastANI)

  • Integrated digital DNA-DNA hybridization (dDDH) results (TYGS)

  • ANI-based cluster delineations

  • Core-genome SNP alignments and maximum-likelihood phylogenies (Snippy)

  • Core-gene phylogenomics (bcgTree)

  • Final phylogenetic visualizations

All analyses were performed in a Linux (WSL2) environment.

The computational workflow scripts are maintained in a separate version-controlled GitHub repository and are available upon reasonable request.

The dataset enables reproducibility of the genome-based taxonomic analyses presented in the thesis and may support further comparative genomic studies within Corynebacterium.

These data are associated with the genomic characterization of potentially novel Corynebacterium species.

Files

Restricted

The record is publicly accessible, but files are restricted. <a href="https://zenodo.org/account/settings/login?next=https://zenodo.org/records/18717365">Log in</a> to check if you have access.

Additional details

Dates

Available
2026-02-27