Published October 29, 2025 | Version 1
Dataset Open

Dataset for publication: An inter-laboratory study characterizes the impact of bioinformatic approaches on genome-based cluster detection for foodborne bacterial pathogens

  • 1. Institute of Bacterial Infections and Zoonoses, Friedrich-Loeffler-Institute, Jena, Germany
  • 2. ROR icon Federal Institute for Risk Assessment

Description

This dataset is part of a dry-lab interlaboratory study conducted across Germany, regarding bacterial outbreak detection based on NGS data, with a focus on bioinformatic analysis of four species to identify potential variability caused by different data analysis approaches and human interpretation. Participants were asked to follow their usual in-house protocols while adhering to the general guidelines. A quality assessment (with sample exclusion) was followed by 7-gene Multilocus-Sequence Typing (MLST), core genome Multilocus Sequencing Typing (cgMLST), and SNP calling. The participants were then asked to identify clusters. The study was not intended to resemble a standard proficiency test with a passing/failing grade, but rather to investigate and quantify obvious variability in the results and, where possible, the reasons for it. For this purpose, the datasets included borderline cases in terms of quality.

Files

00_project_name_data_overview.CSV

Files (78.3 GB)

Name Size Download all
md5:4991944d202fcb6de7743d6546c138b6
759 Bytes Preview Download
md5:639f1f856ee79c6b96d3531f8653ea99
10.9 GB Preview Download
md5:43bfa19df4310b764cff09d3e32f6c4d
39.4 GB Preview Download
md5:fd320c697ce1d75debf0a7693c24211f
12.3 GB Preview Download
md5:3745668b814d08e770a8bdbc619f33c3
15.7 GB Preview Download