Published January 16, 2022 | Version v1
Dataset Open

DRIAMS: Database of Resistance Information on Antimicrobials and MALDI-TOF Mass Spectra

  • 1. Swiss Federal Institute of Technology in Zurich
  • 2. University of Basel
  • 3. University Hospital of Basel

Description

Early administration of effective antimicrobial treatments is critical for the outcome of infections and the prevention of treatment resistance. Antimicrobial resistance testing enables the selection of optimal antibiotic treatments, but current culture-based techniques can take up to 72 hours to generate results. We have developed a novel machine learning approach to predict antimicrobial resistance directly from MALDI-TOF mass spectra profiles of clinical samples. We trained calibrated classifiers on a newly-created publicly available database of mass spectra profiles from clinically most relevant isolates with linked antimicrobial susceptibility phenotypes. The dataset combines more than 300,000 mass spectra with more than 750,000 antimicrobial resistance phenotypes from four medical institutions. Validation against a panel of clinically important pathogens, including Staphylococcus aureus, Escherichia coli, and Klebsiella pneumoniae, resulting in AUROC values of 0.80, 0.74, and 0.74 respectively, demonstrated the potential of using machine learning to substantially accelerate antimicrobial resistance determination and change of clinical management. Furthermore, a retrospective clinical case study found that implementation of this approach would have resulted in a beneficial change in the clinical treatment in 88% (8/9) of cases. MALDI-TOF mass spectra based machine learning may thus be an important new tool for treatment optimization and antibiotic stewardship.

Notes

We recommend using our Python package for MALDI-TOF preprocessing and machine learning analysis, maldi-learn (https://github.com/BorgwardtLab/maldi-learn), to load and analyse DRIAMS data.

The github package comes with an elaborate README file, which gives details on installation and usage examples. In order to use this package the locations of data files and folder structure must be preserved. Please note that all four downloaded data packages should be kept in one folder, serving as the DRIAMS root folder, which then needs to be set as the DRIAMS_ROOT path in the .env file.

The folder structure obtained after download is the following:

DRIAMS
├── DRIAMS-A
│   ├── binned_6000
│   ├── id
│   ├── preprocessed
│   └── raw
├── DRIAMS-B
│   ├── binned_6000
│   ├── id
│   ├── preprocessed
│   └── raw
├── DRIAMS-C
│   ├── binned_6000
│   ├── id
│   ├── preprocessed
│   └── raw
└── DRIAMS-D
    ├── binned_6000
    ├── id
    ├── preprocessed
    └── raw

Funding provided by: Alfried Krupp von Bohlen und Halbach-Stiftung
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100005306
Award Number: Alfried Krupp Prize for Young University Teachers

Funding provided by: D-BSSE-Uni-Basel Personalised Medicine grant*
Crossref Funder Registry ID:
Award Number: PMB-03-17

Files

README.md

Files (144.8 GB)

Name Size Download all
md5:ca32ee8893afd57f6e80f4d7f3a37cf1
86.2 GB Download
md5:86ef245c53b1a2cf547d04dee1dc79ea
3.7 GB Download
md5:b11fd90a8986bc679434ab9468759f9c
12.4 GB Download
md5:21483b6232da155edc324febe5208ee1
42.6 GB Download
md5:e4852b228e1e965649957d6f2cee2873
5.3 kB Preview Download

Additional details

Related works