Published September 26, 2023 | Version 1.0
Dataset Open

Data from: Automatic patient-level recognition of four Plasmodium species on thin blood smear by a Real Time Detector Transformer (RT-DETR) object detection algorithm: a proof-of-concept and evaluation

  • 1. Department of Parasitology/Mycology, Academic Hospital (CHU) of Toulouse, Toulouse, France.
  • 2. Univ Rouen Normandie, Laboratory of Parasitology-Mycology, EA7510 ESCAPE, University hospital of Rouen, Normandie, France
  • 3. Université de Paris Cité, Laboratoire de Parasitologie-Mycologie, Groupe Hospitalier Saint-Louis-Lariboisière-Fernand-Widal, Assistance Publique-Hôpitaux de Paris (AP-HP), Paris, France.
  • 4. Laboratory of Parasitology-Mycology, INSERM U1285, Unité de Glycobiologie Structurale et Fonctionnelle (CNRS UMR 8576), University Hospital (CHU) of Lille, University of Lille, Lille, France.
  • 5. Nantes University, Academic Hospital (CHU) of Nantes, Cibles et Médicaments des Infections et de l'Immunité, IICiMed, UR1155, Nantes, France.
  • 6. University of Montpellier, Department of Physical Chemistry and Biophysics, Academic Hospital (CHU) of Montpellier, Department of Parasitology/Mycology, National Reference Centre (CNR) for Paludism, Montpellier, France.
  • 7. Laboratory of Parasitology/Mycology, UMI 233 TransVIHMI, University of Montpellier, IRD, INSERM U1175
  • 8. Univ Rouen Normandie, Laboratory of Parasitology-Mycology, EA7510 ESCAPE, University hospital of Rouen, Normandie, France.
  • 9. University of Montpellier, Academic Hospital (CHU) of Montpellier, Department of Parasitology/Mycology, National Reference Centre (CNR) for Paludism, Montpellier, France.

Description

Automatic patient-level recognition of four Plasmodium species on thin blood smear by a Real Time Dectector Transformer (RT-DETR) object detection algorithm: a proof-of-concept and evaluation

Emilie Guemas, Baptiste Routier, Théo Ghelfenstein-Ferreira, Camille Cordier, Sophie Hartuis, Bénédicte Marion, Sébastien Bertout, Emmanuelle Varlet-Marie, Damien Costa, Grégoire Pasquier

Abstract:

Malaria remains a global health problem with 247 million cases and 619,000 deaths in 2021. Diagnostic of Plasmodium species is important for administering the appropriate treatment. The gold-standard diagnosis from accurate species identification remains the thin blood smear. Nevertheless, this method is time-consuming and requires highly skilled and trained microscopists. To overcome these issues, new diagnostic tools based on deep learning are emerging. This study aimed to evaluate the performances of a RT-DETR (Real-Time Detection Transformer)object detection algorithm to discriminate Plasmodium species on thin blood smears images. The algorithm was trained and validated on a dataset consisting in 24,720 images from 475 thin blood smears corresponding to 2,002,597 labels. Performances were calculated with a test dataset of 4,508 images from 170 smears corresponding to 358,825labels coming from six French university hospital. At the patient level, the RT-DETR algorithm exhibited an overall accuracy of 79.4% (135/170) with a recall of 74% (40/54) and 81.9% (95/116) for negative and positive smears, respectively. Among Plasmodium positive smears, the global sensitivity was 82.7% (91/110) with a sensitivity of 90% (38/42), 81.8% (18/22) and 76.1% (35/46) for P. falciparum, P. malariae and P. ovale/vivax, respectively. The YOLOv5 model achieved a World Health Organization (WHO) competence level 2 for species identification. Besides, the RT-DETR algorithm may be run in real-time on low-cost devices such as a smartphone and could be suitable for deployment in low-resource setting areas where microscopy experts are lacking.

Data collection:

The training and validation dataset included 24,720 pictures taken from 475 manually May Grunwald-Giemsa (MGG)-stained thin blood smears from the Montpellier University Hospital collection and for a smaller part from the Toulouse University Hospital collection. In Montpellier, the pictures were taken with a Flexcam C1 microscope camera (Leica) attached to a Leica DM 2000 microscope and Leica DF450C microscope camera adapted with a Leica DM2500 microscope at X1000 magnification. Labelling of pictures was performed manually, and then automatically with manual correction with a Computer Visual Annotation Tools (CVAT) free software. Nine categories of labels were used: white blood cells (n=3,338), red blood cells (n=1,887,781), platelets (n=48,520), Trypanosoma brucei (n=2,773), and red blood cells infected by P. falciparum (n=43,545), P. ovale (n=4,651), P. vivax (n=4,115), P. malariae (n=2,849) and Babesia divergens (n=5,142).

The test dataset included 4,508 pictures taken from 170 thin blood smears from the same number of patients from the Parasitology laboratories of University Hospitals of Montpellier, Toulouse, Rouen, Lille, Nantes and Saint-Louis in Paris (Table 1). Among these 170 patients, 54 were not infected, including two patients with Howell-Jolly bodies, and 116 were infected with hematozoa. For each patient, between 20 and 30 photos were taken from one thin blood smear with at least one hematozoan parasite per picture for infected patients.

Accurate species diagnostic was made by a senior parasitologist, and for recent smears, it was confirmed by specific PCR, either performed locally (Toulouse) or at the Malaria French National Reference Center (Montpellier, Saint Louis, Rouen, Lille, Nantes).

Files

models.zip

Files (44.8 GB)

Name Size Download all
md5:223f32ecb572016828cff3b361ee2522
411.6 MB Preview Download
md5:de5ed551ac2162c2574b13c6b25d7444
10.5 GB Download
md5:76cbc00b44810cf3c02251636cb94331
10.5 GB Download
md5:8702a650475d13a2c7a301499f1184a3
10.5 GB Download
md5:06b97fd08051a9ce831bcf4e0860be06
10.5 GB Download
md5:9c7337d7e7471d5e1bd84988396cecb8
2.5 GB Download