Published May 13, 2025 | Version v1
Dataset Open

Materials for Benchmarking Spectral Library and Database Search Approaches for Metaproteomics Using a Ground-Truth Microbiome Dataset

  • 1. ROR icon University of Minnesota System
  • 2. ROR icon University of Minnesota

Contributors

Contact person:

Data manager:

Description

Mass spectrometry-based metaproteomics, the identification and quantification of thousands of proteins expressed by complex microbial communities, has become pivotal for unraveling functional interactions within microbiomes. However, metaproteomics data analysis encounters many challenges, including the search of tandem mass spectra against a protein sequence database using proteomics database search algorithms. We used a ground-truth dataset to assess a spectral library searching method against established database searching approaches. Mass spectrometry data collected by data-dependent acquisition (DDA-MS) was analyzed using database searching approaches (MaxQuant and FragPipe), as well as using Scribe with Prosit predicted spectral libraries. We used FASTA databases that included protein sequences from microbial species present in the ground-truth dataset along with background protein sequences, to estimate error rates and assess the effects on detection, peptide-spectral match quality, and quantification. Using the Scribe search engine resulted in more proteins detected at a 1% false discovery rate (FDR) compared to MaxQuant or FragPipe, while FragPipe detected more peptides verified by PepQuery. Scribe was able to detect more low-abundance proteins in the microbiome dataset and was more accurate in quantifying the microbial community composition. This research provides insights and guidance for metaproteomics researchers aiming to optimize results in their analysis of DDA-MS data.

Files

1_MASS_SPECTROMETRY_FILES.zip

Files (31.2 GB)

Name Size Download all
md5:e6a667803d61dd76b186e0287cd6ebd6
9.1 GB Preview Download
md5:78145b4867f6c46387db3d560bcc68f7
22.0 GB Preview Download
md5:0b4b0aaddc7903779bdce5da9223073c
96.2 MB Preview Download
md5:a5987964e7af9dda2189296a6865b04c
11.3 MB Preview Download
md5:f296145230fd70e5ede842bbde931ae0
4.3 MB Preview Download
md5:52a1ceed6bb36026931d6933e984c7ea
13.0 MB Preview Download
md5:afdccdd5706322752b12cceeaf15375a
6.7 MB Preview Download
md5:5c7af3107e30de132a259a5da2979151
2.8 MB Preview Download
md5:3adcd6a90d6efdf0c7ead872ddebbe2d
16.9 MB Preview Download

Additional details

Dates

Created
2025