Dataset Open Access

Robust Chest CT Image Segmentation of COVID-19 Lung Infection based on limited data

Dominik Müller; Iñaki Soto Rey; Frank Kramer

Background: The coronavirus disease 2019 (COVID-19) affects billions of lives around the world and has a significant impact on public healthcare. For quantitative assessment and disease monitoring medical imaging like computed tomography offers great potential as alternative to RT-PCR methods. For this reason, automated image segmentation is highly desired as clinical decision support. However, publicly available COVID-19 imaging data is limited which leads to overfitting of traditional approaches.

Methods: To address this problem, we propose an innovative automated segmentation pipeline for COVID-19 infected regions, which is able to handle small datasets by utilization as variant databases. Our method focuses on on-the-fly generation of unique and random image patches for training by performing several preprocessing methods and exploiting extensive data augmentation. For further reduction of the overfitting risk, we implemented a standard 3D U-Net architecture instead of new or computational complex neural network architectures.

Results: Through a k-fold cross-validation on 20 CT scans as training and validation of COVID-19, we were able to develop a highly accurate as well as robust segmentation model for lungs and COVID-19 infected regions without overfitting on limited data. We performed an in-detail analysis and discussion on the robustness of our pipeline through a sensitivity analysis based on the cross-validation and impact on model generalizability of applied preprocessing techniques. Our method achieved Dice similarity coefficients for COVID-19 infection between predicted and annotated segmentation from radiologists of 0.804 on validation and 0.661 on a separate testing set consisting of 100 patients.

Conclusions: We demonstrated that the proposed method outperforms related approaches, advances the state-of-the-art for COVID-19 segmentation and improves robust medical image analysis based on limited data.

The code and model are available under the following link:
https://github.com/frankkramer-lab/covid19.MIScnn

Code: https://github.com/frankkramer-lab/covid19.MIScnn
Files (3.2 GB)
Name Size
evaluation.zip
md5:74c65427c595986c41d74ed5b1ee9bfc
662.0 MB Download
models.zip
md5:20ac6f5186d7e98edbd54bc547d50ad6
2.5 GB Download
predictions.zip
md5:3fb40f3f9c05f224ec22421a7b9ad450
17.6 MB Download
supplementary.zip
md5:2a67cc8a134b7f89996e1321e4465402
4.3 MB Download
3,589
721
views
downloads
All versions This version
Views 3,589606
Downloads 72126
Data volume 925.2 GB19.4 GB
Unique views 3,101512
Unique downloads 35020

Share

Cite as