Published April 6, 2023 | Version 1.0.0
Dataset Open

WormSwin: C. elegans Video Datasets

  • 1. University of Cologne

Contributors

  • 1. Rejuvenate Biomed
  • 2. Imperial College London

Description

Data used for our paper "WormSwin: Instance Segmentation of C. elegans using Vision Transformer".
This publication is divided into three parts:

  1. CSB-1 Dataset
  2. Synthetic Images Dataset
  3. MD Dataset

The CSB-1 Dataset consists of frames extracted from videos of Caenorhabditis elegans (C. elegans) annotated with binary masks. Each C. elegans is separately annotated, providing accurate annotations even for overlapping instances. All annotations are provided in binary mask format and as COCO Annotation JSON files (see COCO website).

The videos are named after the following pattern:

<"worm age in hours"_"mutation"_"irradiated (binary)"_"video index (zero based)">

For mutation the following values are possible: 

  1. wild type
  2. csb-1 mutant
  3. csb-1 with rescue mutation

An example video name would be 24_1_1_2 meaning it shows C. elegans with csb-1 mutation, being 24h old which got irradiated.

Video data was provided by M. Rieckher; Instance Segmentation Annotations were created under supervision of K. Bozek and M. Deserno.

The Synthetic Images Dataset was created by cutting out C. elegans (foreground objects) from the CSB-1 Dataset and placing them randomly on background images also taken from the CSB-1 Dataset. Foreground objects were flipped, rotated and slightly blurred before placed on the background images.
The same was done with the binary mask annotations taken from CSB-1 Dataset so that they match the foreground objects in the synthetic images. Additionally, we added rings of random color, size, thickness and position to the background images to simulate petri-dish edges.

This synthetic dataset was generated by M. Deserno.

The Mating Dataset (MD) consists of 450 grayscale image patches of 1,012 x 1,012 px showing C. elegans with high overlap, crawling on a petri-dish.
We took the patches from a 10 min. long video of size 3,036 x 3,036 px. The video was downsampled from 25 fps to 5 fps before selecting 50 random frames for annotating and patching.
Like the other datasets, worms were annotated with binary masks and annotations are provided as COCO Annotation JSON files.

The video data was provided by X.-L. Chu; Instance Segmentation Annotations were created under supervision of K. Bozek and M. Deserno.


Further details about the datasets can be found in our paper.

Notes

Maurice Deserno and Katarzyna Bozek were supported by the North Rhine-Westphalia return program (311-8.03.03.02-147635), BMBF program Junior Group Consortia in Systems Medicine (01ZX1917B) and hosted by the Center for Molecular Medicine Cologne.

Files

csb-1_dataset.zip

Files (10.6 GB)

Name Size Download all
md5:067ab107cb39eeaec7215644773f2eb8
2.9 GB Preview Download
md5:3310cc61abe99289e16eb8f76027f91f
197.3 MB Preview Download
md5:321dc82031f6ee8f97b68535aef51127
7.5 GB Preview Download

Additional details

Related works

Is published in
Journal article: 10.1038/s41598-023-38213-7 (DOI)
Preprint: 10.1101/2023.04.10.536324 (DOI)
Is supplemented by
Software: https://github.com/bozeklab/worm-swin/tree/main (URL)