Published January 13, 2022 | Version v1.2
Dataset Open

A curated database of fungal pathogens and their host range

Description

This database contains a manually curated set of human, animal and plant pathogens, annotated with their confirmed host range and relevant sources. In addition to that, we include additional sets of plant-associated fungi (which may include non-pathogens), as well as fungi with an automatically assigned, putative human, animal or plant host. The labelled fungal species are linked to their representative GenBank genomes wherever possible. Genomes that were screened, but no label was found, are also included.

[Last update on: 11 Dec 2022]
[Home page: https://dacs-hpi.gitlab.io/pathogenic-fungi/]

The database is stored in a flat-file format. All metadata are stored in all_data_[date].csv, and all_data_[date].rds contains the same data in a compressed format that can be easily loaded in R. The database was first compiled on 9 Oct 2021 (v1.0), and then updated on 2 Jan 2022 (v1.1) and 11 Dec 2022 (v1.2).

The core database is limited to manually confirmed human, animal and plant pathogens with available genomes as of 9 Oct 2021. Those data are a subset of all_data, and are stored in core_fungal_pathogens.csv and core_fungal_pathogens.rds.

The temporal-test subset contains confirmed pathogens with genomes added to GenBank between 9 Oct 2021 and 2 Jan 2022.

You may also be interested in trained neural network models predicting pathogenic potentials of novel fungi from DNA sequences (https://zenodo.org/record/5711877) and simulated Illumina read sets used to train them (https://zenodo.org/record/5846397).

See also the preprint: https://www.biorxiv.org/content/10.1101/2021.11.30.470625 and the paper presented at ECCB '22 and published in Bioinformatics: https://doi.org/10.1093/bioinformatics/btac495.

Files

all_data_2022-01-02.csv

Files (16.7 MB)

Name Size Download all
md5:8753f41da481801902642dcd62ca00f8
6.8 MB Preview Download
md5:ce2bd1e5dde3f2d90e1ae1e06255a72d
623.7 kB Download
md5:31d4577726b5afc1a363302a5e994e95
7.1 MB Preview Download
md5:ce68cf5bf7d54c7adaa477a009ed1109
677.8 kB Download
md5:fcc1b349defa65bc03f4bf1b295001cd
1.1 MB Preview Download
md5:dc728c57e8f88978d2b5d4b558d478ff
322.5 kB Download
md5:f7f306864ac0b2b9d02446cf7da573fe
17.7 kB Preview Download
md5:4555b24be7bdcc942dbd7ec0454bebb2
2.7 kB Download