Classification of blood cells dynamics with convolutional and recurrent neural networks: a sickle cell disease case study
Creators
- 1. ENS Lyon, Lyon, France
- 2. Aix Marseille Univ, CNRS, CINAM, Marseille, France
- 3. Marseille Medical Genetics (MMG), INSERM, Aix Marseille University, 13005Marseille, France
- 4. Sorbonne Universités, UPMC Univ Paris 06, CNRS, LIP6
- 5. Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, IRD, IFSTTAR, ISTerre,Grenoble, France
Description
The fraction of red blood cells (RBC) adopting a specific motion under low shear flow is a promising inexpensive marker for monitoring the clinical status of patients with sickle cell disease (SCD). Its high-throughput measurement relies on the video analysis of thousands of cell motions for each blood sample to eliminate a large majority of unreliable samples(out of focus or overlapping cells) and discriminate between tank-treading and flipping motion, characterizing highly and poorly deformable cells respectively. These videos are of different durations (from 6 to more than 100 frames).
This dataset contains four adult patients with SCD. They were enrolled in the study Drepaforme (approved by the institutional review board CPP Ouest 6 under the reference n°2018A00679-46) and were sampled weekly for several months. The movies were processed using in-house routines in Matlab (Matlab, R2016a) and RBC were detected individually and tracked over time. The database provided in this repository are already pre-processed sequences of tracked and centered RBC over time, each time step image being normalized to 31x31 pixels. Within the 32 experiments, the total number of sequences (or samples) is nearly 150 000. All sequences were semi-automatically labelled into 3 classes, depending on the dynamic of the cell: tank-treading, flipping and unreliable (140 000 are unreliable). The percentage of tank-treading cells with respect to all reliable cells (tank-treading+flipping) in every experiment is the final goal of this study.
This dataset is very interesting to the community as it is a large database for cell dynamics classification: the class depends on the movement of the cell.
An automatic processing of the database using a 2-stage deep learning model is available here https://github.com/icannos/redbloodcells_disease_classification
For opening the data in python:
from scipy.io import loadmat
x=loadmat('BG20191003shear10s01_Export.mat')
* x['Norm_Tab'] is of size nb_samples x max_len_sequences x 31 x 31, where max_len_sequences is the length of the longest sequence of the series, typically ~150 to 180. The other sequences are padded with 31x31 zero matrices at the end in order to fill this maximal length.
* x['Labels_Num'] is the corresponding label of each sequence, of size nb_samples. Label can be:
- 0 : "tank-treading" (or healthy)
- 1 : "flipping" (or tumbling, i.e. related to a SCD)
- 2 : "unreliable"
Files
Files
(4.1 GB)
Name | Size | Download all |
---|---|---|
md5:a204b8f9be9727b480ebe644ff0edb9e
|
61.3 MB | Download |
md5:42edb7698038654af95c1b54e31a367a
|
113.3 MB | Download |
md5:40bd2882dabbf218c412f1ff2a387b09
|
174.9 MB | Download |
md5:60bf46ce2c745c16bde7f617ec6678b9
|
166.0 MB | Download |
md5:92cb47d5dca5f266ffb8e00703e639ef
|
93.8 MB | Download |
md5:d9f6b0c9e980d9e5073ad462b81bf91b
|
99.1 MB | Download |
md5:920a4ea166c1099d74ea9df5739c3696
|
124.2 MB | Download |
md5:92ee79b675e7a031308e1438473cab67
|
226.1 MB | Download |
md5:685b4b44c8f8239d2d969e3b5a9898b8
|
242.0 MB | Download |
md5:24d52310bb800ddd102bec269dbc5e0f
|
234.5 MB | Download |
md5:9022150da92c97db7c4b442cdb653fc4
|
150.5 MB | Download |
md5:30e7cda054e2faed666a59298c34fbab
|
151.5 MB | Download |
md5:93416dfa2c29330101ea8e3fdf7d5d56
|
195.0 MB | Download |
md5:8bb9496385f8781295a32695450bb92b
|
208.8 MB | Download |
md5:a91b1dc6f2b91133ab6950afca93567e
|
81.5 MB | Download |
md5:412f5b01aad145f156947a41ea5d0949
|
83.0 MB | Download |
md5:4f36f0f3db50388b07b4b3c3ba424bb4
|
76.3 MB | Download |
md5:2980ca02c776b0f9a5d83b4a0167d2c6
|
63.7 MB | Download |
md5:b93347d900aefb23162e73e6248a6593
|
64.8 MB | Download |
md5:9ebe402fe24b7b603180f6269d7d4239
|
139.0 MB | Download |
md5:91c3fc04413fe5edf4771a2c33e460f3
|
143.3 MB | Download |
md5:23adb020e35309f5fd6c4e9a995d5d3c
|
70.8 MB | Download |
md5:639767b00fffa96d96b7352b9a5bc31a
|
134.1 MB | Download |
md5:403830517b7ebcb5fd47093cdaa1b7d5
|
159.2 MB | Download |
md5:3f64d575a89f2d720d4a0f84f4bcfcaf
|
89.3 MB | Download |
md5:930f4332e7afc8894c202f308e916b85
|
101.8 MB | Download |
md5:bedff21576b9f03ac33827668df5b1c4
|
90.4 MB | Download |
md5:d2f7cbb033901d09b9662cd30eedb983
|
111.8 MB | Download |
md5:40fd486c512ab78870bfcd4bd9a060aa
|
78.9 MB | Download |
md5:3e53efa6604cc4886e11b0d8f40eaf15
|
110.3 MB | Download |
md5:76129b051bd6c8b0d70e7338007a4090
|
112.3 MB | Download |
md5:fde70eb9165d969f2c8e0f0954445aca
|
117.3 MB | Download |