AI in Lung Health: Benchmarking Detection and Diagnostic Models Across Multiple CT Scan Datasets
Description
AI Model for Lung Cancer Detection and Classification
Benchmarking Across Multiple CT Scan Datasets
Description
This repository contains the model weights for the AI-driven lung cancer detection and classification models developed as part of the study "AI in Lung Health: Benchmarking Detection and Diagnostic Models Across Multiple CT Scan Datasets." The models were trained and validated using a diverse set of CT scan datasets, including DLCSD, LUNA16, and NLST, to ensure robustness and generalizability in lung nodule detection and classification.
Model Information
- Detection Models:
- DLCSD-mD: Trained on the Duke Lung Cancer Screening Dataset (DLCSD), designed to identify lung nodules within 3D CT volumes.
- LUNA16-mD: Trained on the LUNA16 dataset for lung nodule detection.
- Classification Models:
- ResNet50-SWS++: Enhanced ResNet50 using Strategic Warm-Start++ (SWS++) pretraining.
- Genesis & MedNet3D: Open-access self-supervised models.
- FMCB-based classifier: https://www.nature.com/articles/s42256-024-00807-9
Performance Summary
- Detection Task (AUC Scores):
- DLCSD-mD: 0.93 (DLCSD) | 0.97 (LUNA16) | 0.75 (NLST)
- LUNA16-mD: 0.96 (LUNA16) | 0.91 (DLCSD) | 0.71 (NLST)
- Classification Task (AUC Scores):
- ResNet50-SWS++: 0.71 (DLCSD) | 0.90 (LUNA16) | 0.81 (NLST)
Dataset Information
- DLCSD: 2,000+ CT scans, 1,613 patients, over 3,000 annotated nodules.
- LUNA16: 601 patients, 1,186 nodules.
- NLST: 969 patients, 1,192 nodules (annotations adapted from external sources).
Model Weights
The uploaded model weights correspond to the best-performing checkpoints based on validation loss and AUC performance. The weights are provided in PyTorch (.pth) format and can be directly used with the MONAI framework.
Code Repository
The full codebase, including training scripts, preprocessing pipelines, and evaluation metrics, is available at
Gitlab: https://gitlab.oit.duke.edu/cvit-public/ai_lung_health_benchmarking
GitHub: https://github.com/fitushar/AI-in-Lung-Health-Benchmarking-Detection-and-Diagnostic-Models-Across-Multiple-CT-Scan-Datasets/
Citation
If you use this model in your research, please cite:
Tushar et al., "AI in Lung Health: Benchmarking Detection and Diagnostic Models Across Multiple CT Scan Datasets," arXiv:2405.04605.
@article{tushar2024ai, title={AI in Lung Health: Benchmarking Detection and Diagnostic Models Across Multiple CT Scan Datasets}, author={Tushar, Fakrul Islam and Wang, Avivah and Dahal, Lavsen and Harowicz, Michael R and Lafata, Kyle J and Tailor, Tina D and Lo, Joseph Y}, journal={arXiv preprint arXiv:2405.04605}, year={2024} }
Files
Files
(801.0 MB)
Name | Size | Download all |
---|---|---|
md5:2b1898b5fd97c305489ebf70b3737949
|
83.8 MB | Download |
md5:9c2ac0b3d095a2fb95b07d1c820dcf07
|
185.0 MB | Download |
md5:34287210c2dac6053705ecb5a172d4f5
|
185.0 MB | Download |
md5:b50acd277a6530e242afb06b28ba8ea2
|
78.5 MB | Download |
md5:c04b1d371ffc44e4909d238e0b78cd75
|
185.0 MB | Download |
md5:79fc11dfce7cce713720e25093648c92
|
83.8 MB | Download |
Additional details
Related works
- Cites
- Preprint: arXiv:2405.04605 (arXiv)
Dates
- Updated
-
2025-03-04
Software
- Repository URL
- https://github.com/fitushar/AI-in-Lung-Health-Benchmarking-Detection-and-Diagnostic-Models-Across-Multiple-CT-Scan-Datasets
- Programming language
- Python
References
- Tushar, Fakrul Islam, et al. "AI in Lung Health: Benchmarking Detection and Diagnostic Models Across Multiple CT Scan Datasets." arXiv preprint arXiv:2405.04605 (2024).