Published June 4, 2026 | Version v1

Listeria monocytogenes biofilm structure dataset [Part 2: strain characterisation, manual visual assessment of biofilm images, and inference on food extract-perturbed biofilms]

  • 1. ROR icon Jožef Stefan Institute
  • 2. Hungarian University of Agriculture and Life Sciences
  • 3. ROR icon University of Ljubljana

Description

This record is Part 2 of the data associated with the publication:
Janež, Škrlj, Osojnik et al. MicroICS: Extracting predictive information from structural features of Listeria monocytogenes biofilms for strain identification and biological context associations.

This part provides data supporting strain characterisation, classifier performance, and inference. It includes custom protein databases used for biomarker identification, metadata on the strains, visual inspection results that served as the baseline for algorithm performance comparison, and feature files and images used for inference on biofilms perturbed by food residues.

datafile_17_11_2023_3D_z_21_with_exp_ctrl_results – Feature calculation results for the training dataset 17_11_2023_3D_z_21_with_exp_ctrl (dataset 17_11_2023_3D_z_21 with experimental controls substituted). Used to train the random forest classifier and to perform inference on biofilm images with altered structure in E8_images_for_prediction.

E8_images_for_prediction – Images used for inference testing, comprising biofilms treated with food extracts and their corresponding controls.

metadata_on_Listeria_strains_used – Epidemiological, sequencing, and genomic analysis data for the strains used in this study.

biofilm_associated_proteins.fasta – Custom database of biofilm-associated proteins used with BLAST to identify homologues in the genomes of the selected strains.

wall_teichoic_acid_synthesis_associated_proteins.fasta – Custom database of wall teichoic acid synthesis-associated proteins used with BLAST to identify homologues in the genomes of the selected strains.

Prediction_set_shuffled, training_set – Selected images used for visual classification

visual_classification_results – Results of the visual classification of biofilm images conducted by three laboratory members.

Files

E8_images_for_prediction.zip

Files (15.8 GB)

Name Size
md5:53f72b9eecd8dccbff8fd8d89803b73e
5.8 kB Download
md5:c7b1156eeadd91c410f834f5a3aeb0a0
40.5 MB Download
md5:12486a320c670162cbc4697543e05a4d
13.3 GB Preview Download
md5:1289fa0282cff059585806cf515748e9
88.7 kB Download
md5:e60a11c4074a1ed1274fdb714738327c
1.3 GB Preview Download
md5:cf678d1f6e795043af7337dff462be9b
1.2 GB Preview Download
md5:e3eee6e12658a34ba217343f4073c8d6
66.7 kB Download
md5:203e1a3e8e22c8e23d0e6820a928c9cf
10.4 kB Download

Additional details

Funding

The Slovenian Research and Innovation Agency