Published April 6, 2025 | Version 1.0
Model Open

Classification of Multiple Diseases on Body CT Scans Using Weakly Supervised Deep Learning- Model weights

  • 1. ROR icon Duke University

Contributors

  • 1. ROR icon Duke University

Description

 

Model Documentation: Multidisease Classification Models for Body CT Scans

This document provides an overview and usage guidance for three deep learning models developed to perform multidisease classification on body CT scans. The models are based on 3D convolutional neural networks implemented in Python using TensorFlow, and they were trained using weak supervision derived from radiology report text.

Background and Purpose

These models were developed as part of a retrospective study aiming to detect multiple common disease conditions across three major organ systems—lungs and pleura, liver and gallbladder, and kidneys and ureters—using body CT scans. Labels for training were extracted using rule-based natural language processing (NLP) from radiology reports, enabling efficient training without extensive manual annotation.

The work demonstrates how weak supervision can support the development of clinically relevant, multi-organ disease classifiers on a large scale.

Model Summary

Each model targets a specific organ system and predicts the presence or absence of five disease categories (four pathologies + one "no apparent disease" class):

  1. Lungs and Pleura: 

    • Labels: Atelectasis, Nodule, Emphysema, Effusion, No Apparent Disease

    • Performance (AUCs):

      • Atelectasis: 0.77

      • Nodule: 0.65

      • Emphysema: 0.89

      • Effusion: 0.97

      • No Apparent Disease: 0.89

  2. Liver and Gallbladder

    • Labels: Hepatobiliary Calcification, Lesion, Dilation, Fatty Liver, No Apparent Disease

    • Performance (AUCs):

      • Calcification: 0.62

      • Lesion: 0.73

      • Dilation: 0.87

      • Fatty: 0.89

      • No Apparent Disease: 0.82

  3. Kidneys and Ureters

    • Labels: Stone, Atrophy, Lesion, Cyst, No Apparent Disease

    • Performance (AUCs):

      • Stone: 0.83

      • Atrophy: 0.92

      • Lesion: 0.68

      • Cyst: 0.70

      • No Apparent Disease: 0.79

The models were trained on CT data from over 13,000 scans and evaluated on a subset of 2,158 volumes with 2,875 manually validated reference labels. Automated label extraction achieved between 91%–99% accuracy during internal validation.

Implementation Details

  • Programming Language: Python

  • Framework: TensorFlow

  • Model Type: 3D Convolutional Neural Network (CNN)

  • Preprocessing: Organ segmentation (via DenseVNet), intensity normalization, and cropping of CT volumes to organ-specific regions of interest.

Repository Links

The source code, model weights, and usage instructions will be made publicly available through:

  • GitHub Repository: https://github.com/fitushar/multi-label-weakly-supervised-classification-of-body-ct

  • GitLab Repository: https://gitlab.oit.duke.edu/railabs/LoGroup/multi-label-weakly-supervised-classification-of-body-ct

These repositories include:

  • Model loading and inference scripts

  • Preprocessing pipeline details

  • Instructions for applying the model to new CT data

  • Evaluation tools and AUC reporting scripts

License and Citation

These models are released for academic research purposes only. If you use them in your work, please cite the original study. Citation details will be provided in the repository README.

Files

Files (43.7 MB)

Name Size Download all
md5:8c6f7a944e2db528b8292432bf6ec898
14.6 MB Download
md5:8d06fb43843551142520a1f10325afa2
14.6 MB Download
md5:3f9e652cb7fe62f01d33da4222f5d9fc
14.6 MB Download

Additional details

Dates

Available
2025-03-04

References

  • Tushar, Fakrul Islam, et al. "Classification of multiple diseases on body CT scans using weakly supervised deep learning." Radiology: Artificial Intelligence 4.1 (2021): e210026.