Duke Lung Nodule Dataset 2024

Wang, Avivah; TUSHAR, FAKRUL ISLAM; Harowicz, Michael R.; Lafata, Kyle J.; Tailor, Tina D.; Lo, Joseph Y.

doi:10.5281/zenodo.10782891

Published March 5, 2024 | Version v1

Dataset Restricted

Duke Lung Nodule Dataset 2024

1. Duke University School of Medicine
2. Duke University
3. Duke University Health System

Background: Lung cancer risk classification is an increasingly important area of research as low-dose thoracic CT screening programs have become standard of care for patients at high risk for lung cancer. There is limited availability of large, annotated public databases for the training and testing of algorithms for lung nodule classification.

Methods: Screening chest CT scans done between January 1, 2015 and June 30, 2021 at Duke University Health System were considered for this study. Efficient nodule annotation was performed semi-automatically by using a publicly available deep learning nodule detection algorithm trained on the LUNA16 dataset to identify initial candidates, which were then accepted based on nodule location in the radiology text report or manually annotated by a medical student and a fellowship-trained cardiothoracic radiologist.

Results: The dataset contains 1613 CT volumes with 2487 annotated nodules. Radiologist spot-checking confirmed the semi-automated annotation had an accuracy rate of >90%.

Conclusions: The Duke Lung Nodule Dataset is the first large dataset for CT screening for lung cancer reflecting the use of current CT technology. This represents a useful resource of lung cancer risk classification research, and the efficient annotation methods described for its creation may be used to generate similar databases for research in the future.

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

You are currently not logged in. Do you have an account? Log in here

	All versions	This version
Views	6,242	1,160
Downloads	2,242	351
Data volume	69.6 TB	11.0 TB

Duke Lung Nodule Dataset 2024

Creators

Description

Files

Restricted

Request access