There is a newer version of the record available.

Published February 14, 2025 | Version v1
Dataset Open

Clinical Dataset Of Bronchoalveolar Lavage Fluid Cell

Authors/Creators

Description

This dataset comprises BALF cell images stained by Diff-quick staining and Gram staining, from patients who underwent bronchoalveolar lavage and endotracheal aspirates between 2018 and 2024 at the Chinese PLA General Hospital. The dataset contained 2105 images, with 13,263 annotated cells from seven typical cell classes, including erythrocyte, ciliated columnar epithelial, squamous epithelial, macrophage, lymphocyte, neutrophil, and eosinophil cells using both contour fine labeling and bounding box labeling. All cells that could be confidently identified by senior clinical cytologists were annotated. A small number of cells, for which consensus could not be reached among the cytologists, were left unannotated due to uncertainty.

In addition, the classical YoloV8 deep learning instance segmentation model was used to detect and segment the seven types of BALF cells. As observed, our model demonstrated exceptionally high accuracy. There are four types of the dataset: High Resolution Images, Images,  Visualization Images, and Labels, which are helpful to promote the study of automated cell identification in BALF.

  • High Resolution Images: a total of 1903 original images (4,912×3,684 pixels) from clinical samples collected by an ImageView optical microscope (202 images were missed because of the initial anonymous input, but were all included in Images). 
  • Images: 2105 images from High Resolution Images resampled to the size of 853×640 for bounding box annotation and pixel-level high quality annotation, then applied directedly for network training. 
  • Visualization Images: images that showed manually labeled contours on each cell.
  • Labels: saved in a txt format for direct use of YOLO series models and a convenient way to convert to other data formats.  

Files

Files (24.4 GB)

Name Size Download all
md5:1d0fa6a6ed2b20da9c37d2d35742f635
3.6 GB Download
md5:d721c4568389c0f929f69a9b45136024
3.8 GB Download
md5:1e43f8908c17e8bf1613e6373dc588d0
5.1 GB Download
md5:262abc9fcfa072db215d89ce2e769711
3.3 GB Download
md5:afb32d7096604bffd4b7658bae50f239
5.8 GB Download
md5:7d0d4e9566bf2ee26b54075bead0b1d8
2.5 GB Download
md5:cb463bfaa959f36c1cd12f402eff647c
118.7 MB Download
md5:b9ef376f4191e727955502198e174608
4.6 MB Download
md5:d5b942051f9f9738fbb16ab76215c1fa
238.8 MB Download