Published March 13, 2020 | Version Version 1
Dataset Open

MS data set: Identification of Microorganisms by Liquid Chromatography-Mass Spectrometry (LC-MS1) and in silico Peptide Mass Data

  • 1. Robert Koch-Institute, Centre for Biological Threats and Special Pathogens, Proteomics and Spectroscopy (ZBS6), Berlin, Germany

Description

Data set consisting of raw LC-MS2 data, LC-MS1 peak data and a description

For unreviewed publication preprint: Identification of Microorganisms by Liquid Chromatography-Mass Spectrometry (LC-MS1) and in silico Peptide Mass Data

ABSTRACT

Over the past decade, modern methods of mass spectrometry (MS) have emerged that allow reliable, fast and cost-effective identification of pathogenic microorganisms. While MALDI-TOF MS has already revolutionized the way microorganisms are identified, recent years have witnessed also substantial progress in the development of liquid chromatography (LC)-MS based proteomics for microbiological applications. For example, LC-tandem mass spectrometry (LC-MS2) has been proposed for microbial characterization by means of multiple discriminative peptides that enable identification at the species, or sometimes at the strain level. However, such investigations can be very time-consuming, especially if the experimental LC-MS2 data are tested against sequence databases covering a broad panel of different microbiological taxa.

In this proof of concept study, we present an alternative bottom-up proteomics method for microbial identification. The proposed approach involves efficient extraction of proteins from cultivated microbial cells, digestion by trypsin and LC-MS measurements. MS1 data are then extracted and systematically tested against an in silico library of peptide mass data compiled in house. The library has been computed from the UniProt Knowledgebase Swiss-Prot and TrEMBL databases and comprises more than 12,000 strain-specific in silico profiles, each containing tens of thousands of peptide mass entries. Identification analysis involves computation of score values derived from spectral distances between experimental and in silico peptide mass data and compilation of score ranking lists. The taxonomic positions of the microbial samples are then determined by using the best-matching database entries. The suggested method is computationally efficient – less than two minutes per sample - and has been successfully tested by a set of 19 different microbial pathogens. The approach is rapid, accurate and automatable and holds great potential for future microbiological applications.

For details see the following preprint: Lasch, P. Schneider, A. Blumenscheit, C. and Doellinger, J. “Identification of Microorganisms by Liquid Chromatography-Mass Spectrometry (LC-MS1) and in silico Peptide Mass Data”. bioRxiv preprint, http://dx.doi.org/10.1101/870089

 

Notes

License type for data base files (spectra): Creative Commons Attribution Non Commercial 4.0 International (CC-BY-NC): Licensees must credit the original authors by stating their names & the original work's title. Licensees may copy, distribute, display, and perform the work and make derivative works and remixes based on it only for non-commercial purposes.

Files

_Overview_of_Tandem_MS_data_files.pdf

Files (46.6 GB)

Name Size Download all
md5:f009ee24923a1cffb96624568bf2c221
1.2 GB Download
md5:26013f9d8edf5d06fd4fa46dba1cd6d7
1.3 GB Download
md5:aa303b281a90804d40439c2d44ee1f73
1.1 GB Download
md5:c785aa8514749daf96c0127e9020254c
1.1 GB Download
md5:501ecc1d9624ea36439e1f70dee9b593
1.3 GB Download
md5:48db419b15528d15b45634f6167c372e
1.2 GB Download
md5:fbe80cc02b8e8985aab05dba67a3fab4
1.2 GB Download
md5:0679e77bc47ac2a00e6ab40f54c3f8b2
1.2 GB Download
md5:818111e303ca562fc7116efa94188f7f
1.1 GB Download
md5:7b0feee8df98b3cad081b9eef7b51c40
1.1 GB Download
md5:8ddfa049c851ad36f73d5387454feed5
1.2 GB Download
md5:5aae27f634664410a86c00988d6af43a
1.2 GB Download
md5:bd3e2dbe7d411c55ad308d0dbff774f3
1.2 GB Download
md5:f2389bcc343f869d8ee4a03cbe2b09bd
1.2 GB Download
md5:030ba7c6d3758e8d8de0746c26a36790
1.1 GB Download
md5:6e911600f20c147ad9c9d9df693b30ee
1.1 GB Download
md5:0e9ec17f909d832d852a31ab4fff7a35
1.2 GB Download
md5:f4be0ac452d5984db7b3590cb02835c3
1.2 GB Download
md5:45e218b59579bfeb556f68f9fed227bd
1.2 GB Download
md5:8ae357197e83563f347780faeb5aba7f
1.2 GB Download
md5:1c576b2ff05386cdf8a1be4f758654d5
1.2 GB Download
md5:21b82fe78468c0138495922550f20764
1.2 GB Download
md5:f420b755889063da64a2ada3187f6654
1.1 GB Download
md5:2ea7223d395eb2def6ed8822f2b811ec
1.1 GB Download
md5:dac063ba537e6791a48cd59203a781a6
1.1 GB Download
md5:5e12849dacc71750a8e111f559096e37
1.1 GB Download
md5:b1706bc4619b40eb4a6844fd770e0d5e
1.1 GB Download
md5:6a39389f78e386af353fa3415db22a59
1.1 GB Download
md5:846db3cfa1db818cb986a993929f8e63
1.4 GB Download
md5:7a7b17bee16baf7d727426f999b0d5e1
1.4 GB Download
md5:3ec3f49b4a71f078f13c392f9a9cc6ff
1.4 GB Download
md5:ed5f092b28c166efe914e3b1c5cfd6af
1.3 GB Download
md5:63d3145dafbc8f82d885c48aa801f64a
1.4 GB Download
md5:dd29ad5b2195aeeab4666201c876c472
1.4 GB Download
md5:8283740998ca2076865a323426ff016d
1.1 GB Download
md5:7b5a45fd8832caaccc4491df96282c04
1.1 GB Download
md5:9f34945dd93577b1652b50e945d09686
1.1 GB Download
md5:db3e55f86dd8c9310a1d6f5a958fd46c
1.3 GB Download
md5:15be534fe825a6d62b842f6326e77655
1.5 GB Download
md5:d207a0a7d7daaf6b355d14198df34030
212.4 kB Preview Download
md5:9b0664ef9b4d369fef7e1c0102397335
80.1 MB Preview Download

Additional details

Related works

Is supplement to
Preprint: 10.1101/870089 (DOI)
Is supplemented by
Other: 10.5281/zenodo.3573996 (DOI)