Phylogenetic Analysis and Species Delineation of Corynebacterium Isolates using Whole-Genome Sequencing
Authors/Creators
Contributors
Supervisor:
Description
This dataset contains raw sequencing data, genome assemblies and derived analyses associated with the Master’s thesis:
Mundt, C.J.F. (2026). Phylogenetic Analysis and Species Delineation of Corynebacterium Isolates using Whole-Genome Sequencing.
The project implements a genome-based taxonomic framework for species delineation within the genus Corynebacterium using whole-genome sequencing data.
The dataset includes:
-
Raw Illumina paired-end reads
- Read-level QC (FastQC)
-
De novo genome assemblies
-
Assembly quality assessment summaries (QUAST, CheckM2)
-
Genome annotations (Prokka)
-
Pairwise ANI matrices (FastANI)
-
Integrated digital DNA-DNA hybridization (dDDH) results (TYGS)
-
ANI-based cluster delineations
-
Core-genome SNP alignments and maximum-likelihood phylogenies (Snippy)
-
Core-gene phylogenomics (bcgTree)
-
Final phylogenetic visualizations
All analyses were performed in a Linux (WSL2) environment.
The computational workflow scripts are maintained in a separate version-controlled GitHub repository and are available upon reasonable request.
The dataset enables reproducibility of the genome-based taxonomic analyses presented in the thesis and may support further comparative genomic studies within Corynebacterium.
These data are associated with the genomic characterization of potentially novel Corynebacterium species.
Files
Additional details
Dates
- Available
-
2026-02-27