NACHOS dataset: OCT and Xray
Authors/Creators
Description
The following datasets are part of a future paper.
Kidney OCT dataset
Three type of tissues were sampled: cortex, medulla, and pelvis. Image size: 185*210 pixels. The OCT dataset comes from 10 porcine kidneys. For each kidney and tissue, there are 30 volumes. Each volume contain 20 images. Total number of images: 10*3*30*20 = 18,000. The same dataset was partitioned in three different levels: image (folder: split_random), volumen (folder: split_volume), and subject (folder: split_subject).
Chest X-ray repository
A chest X-ray repository was built using the ChestX-ray8 dataset, the CheXpert dataset, the MIMIC-CXR dataset, and the PadChest dataset from the TorchXRayVision library. A chest X-ray repository was built using the ChestX-ray8 dataset, the CheXpert dataset, the MIMIC-CXR dataset, and the PadChest dataset from the TorchXRayVision library. The chest X-ray repository was partitioned into four folds using three different partitioning levels. In image-level partitioning(folder: split1_random), images were randomly distributed across four folds. In patient-level partitioning (folder: split2_patient), all images from the same patient were assigned to the same fold. Finally, in dataset-level partitioning (folder: split3_dataset), each dataset was exclusively allocated to a separate fold.
Files
OCT_data.zip
Files
(2.2 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:ff62c5d875c888f507c409c534144f89
|
1.8 GB | Preview Download |
|
md5:f4b544d8b5b0d82af9dd5d7b4b3d5c1b
|
373.6 MB | Preview Download |
Additional details
Software
- Repository URL
- https://github.com/thepanlab/NACHOS
- Programming language
- Python