Classification of Multiple Diseases on Body CT Scans Using Weakly Supervised Deep Learning- Model weights
Description
Model Documentation: Multidisease Classification Models for Body CT Scans
This document provides an overview and usage guidance for three deep learning models developed to perform multidisease classification on body CT scans. The models are based on 3D convolutional neural networks implemented in Python using TensorFlow, and they were trained using weak supervision derived from radiology report text.
Background and Purpose
These models were developed as part of a retrospective study aiming to detect multiple common disease conditions across three major organ systems—lungs and pleura, liver and gallbladder, and kidneys and ureters—using body CT scans. Labels for training were extracted using rule-based natural language processing (NLP) from radiology reports, enabling efficient training without extensive manual annotation.
The work demonstrates how weak supervision can support the development of clinically relevant, multi-organ disease classifiers on a large scale.
Model Summary
Each model targets a specific organ system and predicts the presence or absence of five disease categories (four pathologies + one "no apparent disease" class):
-
Lungs and Pleura:
-
Labels: Atelectasis, Nodule, Emphysema, Effusion, No Apparent Disease
-
Performance (AUCs):
-
Atelectasis: 0.77
-
Nodule: 0.65
-
Emphysema: 0.89
-
Effusion: 0.97
-
No Apparent Disease: 0.89
-
-
-
Liver and Gallbladder
-
Labels: Hepatobiliary Calcification, Lesion, Dilation, Fatty Liver, No Apparent Disease
-
Performance (AUCs):
-
Calcification: 0.62
-
Lesion: 0.73
-
Dilation: 0.87
-
Fatty: 0.89
-
No Apparent Disease: 0.82
-
-
-
Kidneys and Ureters
-
Labels: Stone, Atrophy, Lesion, Cyst, No Apparent Disease
-
Performance (AUCs):
-
Stone: 0.83
-
Atrophy: 0.92
-
Lesion: 0.68
-
Cyst: 0.70
-
No Apparent Disease: 0.79
-
-
The models were trained on CT data from over 13,000 scans and evaluated on a subset of 2,158 volumes with 2,875 manually validated reference labels. Automated label extraction achieved between 91%–99% accuracy during internal validation.
Implementation Details
-
Programming Language: Python
-
Framework: TensorFlow
-
Model Type: 3D Convolutional Neural Network (CNN)
-
Preprocessing: Organ segmentation (via DenseVNet), intensity normalization, and cropping of CT volumes to organ-specific regions of interest.
Repository Links
The source code, model weights, and usage instructions will be made publicly available through:
-
GitHub Repository: https://github.com/fitushar/multi-label-weakly-supervised-classification-of-body-ct
-
GitLab Repository: https://gitlab.oit.duke.edu/railabs/LoGroup/multi-label-weakly-supervised-classification-of-body-ct
These repositories include:
-
Model loading and inference scripts
-
Preprocessing pipeline details
-
Instructions for applying the model to new CT data
-
Evaluation tools and AUC reporting scripts
License and Citation
These models are released for academic research purposes only. If you use them in your work, please cite the original study. Citation details will be provided in the repository README.
Files
Files
(43.7 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:8c6f7a944e2db528b8292432bf6ec898
|
14.6 MB | Download |
|
md5:8d06fb43843551142520a1f10325afa2
|
14.6 MB | Download |
|
md5:3f9e652cb7fe62f01d33da4222f5d9fc
|
14.6 MB | Download |
Additional details
Dates
- Available
-
2025-03-04
Software
References
- Tushar, Fakrul Islam, et al. "Classification of multiple diseases on body CT scans using weakly supervised deep learning." Radiology: Artificial Intelligence 4.1 (2021): e210026.