DeePaC models for novel fungal pathogens and real-time detection of multiple pathogen classes
- 1. Hasso Plattner Institute
Description
A collection of DeePaC ResNet models for
1) pathogenic potential prediction for novel fungal species (input Illumina read length: 250bp)
2) real-time detection of novel bacterial, viral and fungal pathogens (input Illumina read length: 25-250bp). Those models assume four classes: non-pathogens (i.e. commensal bacteria or non-human viruses), pathogenic bacteria, human-infecting viruses, and human-infecting fungi. Two alternative models are provided: we recommend either using the 'log' model (for faster inference), or an ensemble averaging predictions of both models (for better results).
See the code and manual at https://gitlab.com/dacs-hpi/deepac. Model weights for the fungal (-fun-) and multi-class (-multi4-) models in .h5 files and config .ini files.
The models were trained on read sets hosted here: https://zenodo.org/record/5713153 based on a curated database of pathogenic fungi (https://zenodo.org/record/5711852).
See also the preprint: https://www.biorxiv.org/content/10.1101/2021.11.30.470625v1
Files
Files
(149.9 MB)
Name | Size | Download all |
---|---|---|
md5:c28a9c57509fe1594e5c6931570891cf
|
50.0 MB | Download |
md5:fe22f697301be82a05507c9238068e8c
|
6.0 kB | Download |
md5:d7c4ef008668cfc660e8e7f22f1f6aa3
|
50.0 MB | Download |
md5:942b1bfa1bdef7c482b4109ce680d05b
|
6.0 kB | Download |
md5:accce11f11d14d423bc40e5da5bb9a66
|
50.0 MB | Download |
md5:9dc29af47db744fbc59655d147c95b9f
|
6.0 kB | Download |