Published June 17, 2022 | Version 1
Dataset Open

An annotated high-content fluorescence microscopy dataset with Hoechst 33342-stained nuclei and manually labelled outlines

  • 1. Lund University, Faculty of Medicine, Cell Death, Lysosomes and Artificial Intelligence Group and AI Lund

Description

Here we present a benchmarking dataset of fluorescence microscopy images with Hoechst 33342-stained nuclei together with annotations of nuclei, nuclear fragments and micronuclei. Images were randomly selected from an RNA interference screen with a modified U2OS osteosarcoma cell line, acquired on a Thermo Fischer CX7 high-content imaging system at 20x magnification. Labelling was performed by a single annotator and reviewed by a biomedical expert.

The dataset contains 50 images showing over 2000 labelled nuclear objects in total, which is sufficiently large to train well-performing neural networks for instance or semantic segmentation. It is pre-split into training, development and test set, each in a zip file. The dataset should be referred to as Aitslab_bioimaging1. A brief article describing the dataset is also available (Arvidsson M, Kazemi Rashed S, Aits S. 10.1016/j.dib.2022.108769 )

Dataset description:

Fluorescence microscopy images: original .C01 files and files converted to 8-bit .png format (Grayscale)

Annotations: 24-bit .png format (RGB)

Script used to convert C01 to png images: C01_to_png.py file with python code and readme.md file with instructions to run it

Notes

We do have access to additional data that could be annotated. It can be made available upon request. A preprint describing the dataset is available on zenodo, https://doi.org/10.5281/zenodo.6677913 The final version of the article is published in Data in Brief, https://doi.org/10.1016/j.dib.2022.108769

Files

development_nuclei.zip

Files (117.3 MB)

Name Size Download all
md5:99c4f2481fd4fa023744e4abd9846d31
3.8 kB Download
md5:8a2f41fd07f7a244c68459f49a4e87be
23.1 MB Preview Download
md5:d559ccdc398c8b4038714100abda2026
801 Bytes Preview Download
md5:a12fccb5b461151e49d074a2c4138a05
23.4 MB Preview Download
md5:2a6735a75599424a5c2fe1c77bb750bc
70.8 MB Preview Download

Additional details

Related works

Is described by
Preprint: 10.5281/zenodo.6677913 (DOI)
Journal article: 10.1016/j.dib.2022.108769 (DOI)

References

  • Arvidsson M, Rashed SK, Aits S. An annotated high-content fluorescence microscopy dataset with Hoechst 33342-stained nuclei and manually labelled outlines (preprint). zenodo 2022, https://doi.org/10.5281/zenodo.6677913
  • Arvidsson M, Rashed SK, Aits S. An annotated high-content fluorescence microscopy dataset with Hoechst 33342-stained nuclei and manually labelled outlines. Data Brief. 2022 Nov 21;46:108769. doi: 10.1016/j.dib.2022.108769. eCollection 2023 Feb.