Published November 24, 2021 | Version 1.0
Dataset Open

Classification of blood cells dynamics with convolutional and recurrent neural networks: a sickle cell disease case study

  • 1. ENS Lyon, Lyon, France
  • 2. Aix Marseille Univ, CNRS, CINAM, Marseille, France
  • 3. Marseille Medical Genetics (MMG), INSERM, Aix Marseille University, 13005Marseille, France
  • 4. Sorbonne Universités, UPMC Univ Paris 06, CNRS, LIP6
  • 5. Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, IRD, IFSTTAR, ISTerre,Grenoble, France

Description

The fraction of red blood cells (RBC) adopting a specific motion under low shear flow is a promising inexpensive marker for monitoring the clinical status of patients with sickle cell disease (SCD). Its high-throughput measurement relies on the video analysis of thousands of cell motions for each blood sample to eliminate a large majority of unreliable samples(out of focus or overlapping cells) and discriminate between tank-treading and flipping motion, characterizing highly and poorly deformable cells respectively. These videos are of different durations (from 6 to more than 100 frames).

This dataset contains four adult patients with SCD. They were enrolled in the study Drepaforme (approved by the institutional review board CPP Ouest 6 under the reference n°2018A00679-46) and were sampled weekly for several months. The movies were processed using in-house routines in Matlab (Matlab, R2016a) and RBC were detected individually and tracked over time. The database provided in this repository are already pre-processed sequences of tracked and centered RBC over time, each time step image being normalized to 31x31 pixels. Within the 32 experiments, the total number of sequences (or samples) is nearly 150 000. All sequences were semi-automatically labelled into 3 classes, depending on the dynamic of the cell: tank-treading, flipping and unreliable (140 000 are unreliable). The percentage of tank-treading cells with respect to all reliable cells (tank-treading+flipping)  in every experiment is the final goal of this study.

This dataset is very interesting to the community as it is a large database for cell dynamics classification: the class depends on the movement of the cell.

An automatic processing of the database using a 2-stage deep learning model is available here https://github.com/icannos/redbloodcells_disease_classification

For opening the data in python:

  from scipy.io import loadmat
  x=loadmat('BG20191003shear10s01_Export.mat')

  * x['Norm_Tab'] is of size nb_samples x max_len_sequences x 31 x 31, where max_len_sequences is the length of the longest sequence of the series, typically ~150 to 180. The other sequences are padded with 31x31 zero matrices at the end in order to fill this maximal length.

  * x['Labels_Num'] is the corresponding label of each sequence, of size nb_samples. Label can be:
        - 0 : "tank-treading" (or healthy)
        - 1 : "flipping" (or tumbling, i.e. related to a SCD)
        - 2 : "unreliable"

Files

Files (4.1 GB)

Name Size Download all
md5:a204b8f9be9727b480ebe644ff0edb9e
61.3 MB Download
md5:42edb7698038654af95c1b54e31a367a
113.3 MB Download
md5:40bd2882dabbf218c412f1ff2a387b09
174.9 MB Download
md5:60bf46ce2c745c16bde7f617ec6678b9
166.0 MB Download
md5:92cb47d5dca5f266ffb8e00703e639ef
93.8 MB Download
md5:d9f6b0c9e980d9e5073ad462b81bf91b
99.1 MB Download
md5:920a4ea166c1099d74ea9df5739c3696
124.2 MB Download
md5:92ee79b675e7a031308e1438473cab67
226.1 MB Download
md5:685b4b44c8f8239d2d969e3b5a9898b8
242.0 MB Download
md5:24d52310bb800ddd102bec269dbc5e0f
234.5 MB Download
md5:9022150da92c97db7c4b442cdb653fc4
150.5 MB Download
md5:30e7cda054e2faed666a59298c34fbab
151.5 MB Download
md5:93416dfa2c29330101ea8e3fdf7d5d56
195.0 MB Download
md5:8bb9496385f8781295a32695450bb92b
208.8 MB Download
md5:a91b1dc6f2b91133ab6950afca93567e
81.5 MB Download
md5:412f5b01aad145f156947a41ea5d0949
83.0 MB Download
md5:4f36f0f3db50388b07b4b3c3ba424bb4
76.3 MB Download
md5:2980ca02c776b0f9a5d83b4a0167d2c6
63.7 MB Download
md5:b93347d900aefb23162e73e6248a6593
64.8 MB Download
md5:9ebe402fe24b7b603180f6269d7d4239
139.0 MB Download
md5:91c3fc04413fe5edf4771a2c33e460f3
143.3 MB Download
md5:23adb020e35309f5fd6c4e9a995d5d3c
70.8 MB Download
md5:639767b00fffa96d96b7352b9a5bc31a
134.1 MB Download
md5:403830517b7ebcb5fd47093cdaa1b7d5
159.2 MB Download
md5:3f64d575a89f2d720d4a0f84f4bcfcaf
89.3 MB Download
md5:930f4332e7afc8894c202f308e916b85
101.8 MB Download
md5:bedff21576b9f03ac33827668df5b1c4
90.4 MB Download
md5:d2f7cbb033901d09b9662cd30eedb983
111.8 MB Download
md5:40fd486c512ab78870bfcd4bd9a060aa
78.9 MB Download
md5:3e53efa6604cc4886e11b0d8f40eaf15
110.3 MB Download
md5:76129b051bd6c8b0d70e7338007a4090
112.3 MB Download
md5:fde70eb9165d969f2c8e0f0954445aca
117.3 MB Download