DICOM converted Slide Microscopy images for the TCGA-SKCM collection
Creators
- 1. PixelMed Publishing
- 2. Institute for Systems Biology
- 3. General Dynamics IT
- 4. Frederick National Laboratory
- 5. National Cancer Institute
- 6. Brigham and Women's Hospital
Description
This dataset corresponds to a collection of images and/or image-derived data available from National Cancer Institute Imaging Data Commons (IDC) [1]. This dataset was converted into DICOM representation and ingested by the IDC team. You can explore and visualize the corresponding images using IDC Portal here: TCGA-SKCM. You can use the manifests included in this Zenodo record to download the content of the collection following the Download instructions below.
Collection description
Melanoma is a cancer in a type of skin cells called melanocytes. Melanocyes are the cells that produce melanin, which colors the skin. When exposed to sun, these cells make more melanin, causing the skin to darken or tan. Melanoma can occur anywhere on the body and risk factors include fair complexion, family history of melanoma, and being exposed to natural or artificial sunlight over long periods of time. Melanoma is most often discovered because it has metastasized, or spread, to another organ, such as the lymph nodes. In many cases, the primary skin melanoma site is never found. Because of this challenge, TCGA is studying primarily metastatic cases (in contrast to other cancers selected for study, where metastatic cases are excluded). For 2011, it was estimated that there were 70,230 new cases of melanoma and 8,790 deaths from the disease.
Please see the TCGA-SKCM information page to learn more about the images and to obtain any supporting metadata for this collection.
Citation guidelines can be found on the Citing TCGA in Publications and Presentations information page.
Files included
A manifest file's name indicates the IDC data release in which a version of collection data was first introduced.
For example, collection_id-idc_v8-aws.s5cmd
corresponds to the contents of the
collection_id
collection introduced in IDC data
release v8. If there is a subsequent version of this Zenodo page, it will indicate when a subsequent version of
the corresponding collection was introduced.
tcga_skcm-idc_v10-aws.s5cmd
: manifest of files available for download from public IDC Amazon Web Services bucketstcga_skcm-idc_v10-gcs.s5cmd
: manifest of files available for download from public IDC Google Cloud Storage bucketstcga_skcm-idc_v10-dcf.dcf
: Gen3 manifest (for details see https://learn.canceridc.dev/data/organization-of-data/guids-and-uuids)
Note that manifest files that end in -aws.s5cmd
reference files stored in Amazon Web Services (AWS) buckets, while -gcs.s5cmd
reference
files in Google Cloud Storage. The actual files are identical and are mirrored between AWS and GCP.
Download instructions
Each of the manifests include instructions in the header on how to download the included files.
To download the files using .s5cmd
manifests:
- install idc-index package:
pip install --upgrade idc-index
- download the files referenced by manifests included in this dataset by passing the
.s5cmd
manifest file:idc download manifest.s5cmd
.
To download the files using .dcf
manifest, see manifest header.
Acknowledgments
Imaging Data Commons team has been funded in whole or in part with Federal funds from the National Cancer Institute, National Institutes of Health, under Task Order No. HHSN26110071 under Contract No. HHSN261201500003l.
References
[1] Fedorov, A., Longabaugh, W. J. R., Pot, D., Clunie, D. A., Pieper, S. D., Gibbs, D. L., Bridge, C., Herrmann, M. D., Homeyer, A., Lewis, R., Aerts, H. J. W., Krishnaswamy, D., Thiriveedhi, V. K., Ciausu, C., Schacherer, D. P., Bontempi, D., Pihl, T., Wagner, U., Farahani, K., Kim, E. & Kikinis, R. National Cancer Institute Imaging Data Commons: Toward Transparency, Reproducibility, and Scalability in Imaging Artificial Intelligence. RadioGraphics (2023). https://doi.org/10.1148/rg.230180
Files
Files
(337.3 kB)
Name | Size | Download all |
---|---|---|
md5:0818d108a425c32e09f6926e0a7b7aa6
|
61.2 kB | Download |
md5:64ddf1e00fe99468545f8f79e78dc202
|
209.3 kB | Download |
md5:92c117aa6fc3bc5cb92ec8fc352ed6de
|
66.9 kB | Download |
Additional details
Related works
- Cites
- 10.1148/rg.230180 (DOI)
- Is published in
- 10.25504/FAIRsharing.0b5a1d (DOI)