Beyond White-Nose Syndrome: A Multi-Scale Genomic Analysis of Pseudogymnoascus destructans
Creators
Description
Abstract
White-Nose Syndrome (WNS) has devastated insectivorous bat populations, particularly in North America, leading to severe ecological and economic consequences. Despite extensive research, many aspects of the evolutionary history, mitochondrial genome organization, and metabolic adaptations of its etiological agent, Pseudogymnoascus destructans, remain unexplored. Here, we present a multi-scale genomic analysis integrating pangenome reconstruction, phylogenetic inference, Bayesian divergence dating, comparative mitochondrial genomics, and refined functional annotation. We show that P. destructans exhibits extensive mitochondrial genome rearrangements absent in its nonpathogenic relatives from the Leotiomycetesclass, suggesting a potential link between mitochondrial evolution and pathogenic adaptation. Our divergence dating analysis reveals that P. destructans separated from its Antarctic relatives approximately 141 million years ago, before adapting to bat hibernacula in the Northern Hemisphere. Additionally, our refined functional annotation significantly expands the known functional landscape of P. destructans, revealing an extensive repertoire of previously uncharacterized proteins involved in carbohydrate metabolism and secondary metabolite biosynthesis – key processes that likely contribute to its pathogenic success. By providing new insights into the genomic basis of P. destructans adaptation and pathogenicity, our study refines the evolutionary framework of this fungal pathogen and creates the foundation for future research on WNS mitigation strategies.
01_PanPhylo_analysis/
This directory contains all the files generated and analysed during the pangenome and phylogenomic analyses:
- pangenome — data from pangenome analysis:
- data — directory with the list of accession numbers of mitochondrial genomes to be analysed
- Annotation — pre-annotated mithocondrial genomes from RefSeq database:
- Genes — directory with .fasta files of nucleotide sequences
- Proteins_classic — directory with .fasta files of amino-acid sequences
- Proteins — .fasta files with renamed aa seqs
- LSINFO-.lst — list file for input in PanACoTA
- fLSTINFO-.lst — filtered list file for extracting the shell pangenome
- Pangenome — PanACoTA's build pangenome with strict protein identity parameter (i = 0.9)
- Coregenome — extracted shell genome (proteins persistent in 2/3 of analysed genomes)
- Alignment — PanACoTA's align (MAFFT) module output to extract the sequences of shell genome
- ... — a lot of log files
- MSAs — renamed MSAs to understand which gene family means what
- trimmed_MSAs — trimAl's trimmed MSAs
- model-finder — ModelFinder log files on all the trimmed MSAs
- tree — final phylogenies constructed using the best substitution model on all the trimmed MSAs
- phylogenomics — data from phylogenomics analysis:
- Proteins_renamed; Proteins_renamed_r2; Proteins_renamed_r3 — directories with .fasta files of amino-acid sequences with several rounds of renaming process to make them fit Proteinortho input requirements
- protein_ortho_output — directory with all the output files of Proteinortho
- All; All_names — directories with technical data used to extract SCOs
- all_pep.fa — .fasta file with all the mitochondrial proteomes combined used to extract SCOs
- All_seqs; All_seqs_renamed — directories with .fasta files of SCOs
- MSAs — renamed MSAs to understand which gene family means what
- trimmed_MSAs — trimAl's trimmed MSAs
- model-finder — ModelFinder log files on concatenated trimmed MSAs
- tree — final phylogenies constructed using the best substitution model on concatenated trimmed MSAs
- metadata — directory with the GenBank's metadata on analysed mitochodrial genomes fetched with Phyloki:
- raw_metadata.tsv — Phyloki's first results
- metadata.tsv — data with filtered Year column
02_Comparative_genomics/
This directory contains all the files generated and analysed during the comparative genomic analysis:
- data — directory with analysed genomes both in .fasta and .gb formats
- ANI — all the Average Nucleotide Identity analysis data:
- querylist.txt; reflist.txt — FastANI's inputs
- fastani.out; fastani.out.matrix — FastANI's outputs
- ANI.csv — data from the ANI heatmap
03_Dating/
This directory contains all the files generated and analysed during the Bayesian evolutionary analysis:
- data — directory with all the data generated by analysis:
- dating_super_tree.xml — BEAUti's generated BEAST file
- dating_super_tree.trees; dating_super_tree.ops; dating_super_tree.log — BEAST outputs
- dating_super_tree.tree — TreeAnnotator's annotated tree
- dating_super_tree_ready.tree — tree ready for visualization
- screenshots — screenshots of GUIs applications parameters set prior to running the analysis
04_Functional_annotation/
This directory contains all the files generated and analysed during the functional annotation analysis:
- data — directory with the initial files to be analysed:
- characterized.fasta — all the sequences available in RefSeq database by
'"Pseudogymnoascus destructans" AND Fungi NOT "uncharacterized" AND srcdb_refseq[PROP]'
- uncharacterized.fasta — all the sequences available in RefSeq database by
'"Pseudogymnoascus destructans" AND Fungi AND "uncharacterized" AND srcdb_refseq[PROP]' query
- complete.fasta — merged .fasta file (characterized + uncharacterized)
- characterized.fasta — all the sequences available in RefSeq database by
- eggNOG — eggNOG-mapper annotations on all three profiles:
- characterized — annotations on characterized.fasta file
- characterized.emapper.annotations — main eggNOG-mapper's annotation file
- clean_characterized.emapper.annotations — eggNOG-mapper annotation file with removed duplicated
- characterized.emapper.seed_orthologs; characterized.emapper.genepred.fasta; characterized.emapper.genepred.gff; characterized.emapper.hits — other eggNOG-mapper annotation files
- characterized_cog_category_counts.tsv — count file with COG categories
- characterized_cog_category_counts_clean.tsv — processed count file with COG categories where multi-letter COG categories are treated like single-letter categories based on the 1st letter (e.g. KTN -> K)
- uncharacterized — annotations on uncharacterized.fasta file:
- Same as characterized
- complete — annotations on complete.fasta file:
- Same as characterized
- characterized — annotations on characterized.fasta file
- KEGGaNOG_data — data generated from running KEGGaNOG on characterized.emapper.annotations; uncharacterized.emapper.annotations & complete.emapper.annotations (this data was generated just for fun, it is not mentioned in the paper and the description will not be provided)
Files
01_PanPhylo_analysis.zip
Additional details
Dates
- Available
-
2025-03-18First published version