Published August 28, 2025 | Version v1
Dataset Open

Lung-PET-CT-Dx-Annotations: Expert annotation of lung tumors for the Lung-PET-CT-Dx collection

  • 1. ROR icon Brigham and Women's Hospital
  • 2. Brigham and Women's Hospital Department of Radiology

Description

This collection contains expert annotations of bounding boxes identifying locations of the tumor in the Lung-PET-CT-Dx collection [1] in the individual slices of the images. The annotations are accompanied by the assignment of tumor type.

The annotations were originally shared as XML files attached to the Lung-PET-CT-Dx collection. This dataset contains the earlier shared annotations harmonized into DICOM Structured Report representation. This dataset is available from the NCI Imaging Data Commons (IDC), and can be explored interactively in the IDC Portal using this link:  https://portal.imaging.datacommons.cancer.gov/explore/filters/?analysis_results_id=Lung-PET-CT-Dx-Annotations

The location of each tumor was annotated by five academic thoracic radiologists with expertise in lung cancer to make this dataset a useful tool and resource for developing algorithms for medical diagnosis.  Two of the radiologists had more than 15 years of experience and the others had more than 5 years of experience. After one of the radiologists labeled each subject the other four radiologists performed a verification, resulting in all five radiologists reviewing each annotation file in the dataset. Annotations were originally captured using Labellmg

Specific files included in the record can be downloaded using the attached manifests. The suffix of the manifest indicates its content, which is the list of pointers to the public Google Cloud Storage (GCS) or Amazon Web Services (AWS) buckets containing the files included in the collection:

  1. -gcs.s5cmd: GCS-based manifest (to download the files described in the manifest, execute this command: pip install --upgrade idc-index && idc download manifest).

  2. -aws.s5cmd: AWS-based manifest (to download the files described in the manifest, execute this command: pip install --upgrade idc-index && idc download manifest).

  3. -dcf.dcf: Gen3-based manifest (see details in https://learn.canceridc.dev/data/organization-of-data/guids-and-uuids). 

[1] Li, P., Wang, S., Li, T., Lu, J., HuangFu, Y., & Wang, D. (2020). A Large-Scale CT and PET/CT Dataset for Lung Cancer Diagnosis (Lung-PET-CT-Dx) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/TCIA.2020.NNC2-0461

Files

Files (189.9 kB)

Name Size Download all
md5:510b5c71c8177f4c3418db43d6088d3a
70.2 kB Download
md5:34fa4374876b3126089a083f34074090
49.5 kB Download
md5:b9dc2d463332f746f3146a5dfacf461a
70.2 kB Download

Additional details

Related works

Is derived from
Dataset: 10.7937/TCIA.2020.NNC2-0461 (DOI)
Is published in
Other: 10.25504/FAIRsharing.0b5a1d (DOI)