Published February 10, 2025 | Version v1
Dataset Open

NACHOS dataset: OCT and Xray

Description

The following datasets are part of a future paper.

Kidney OCT dataset

Three type of tissues were sampled: cortex, medulla, and pelvis. Image size: 185*210 pixels. The OCT dataset comes from 10 porcine kidneys. For each kidney and tissue, there are 30 volumes. Each volume contain 20 images. Total number of images: 10*3*30*20 = 18,000. The same dataset was partitioned in three different levels: image (folder: split_random), volumen (folder: split_volume), and subject (folder: split_subject).

Chest X-ray repository

A chest X-ray repository was built using the ChestX-ray8 dataset, the CheXpert dataset, the MIMIC-CXR dataset, and the PadChest dataset from the TorchXRayVision library. A chest X-ray repository was built using the ChestX-ray8 dataset, the CheXpert dataset, the MIMIC-CXR dataset, and the PadChest dataset from the TorchXRayVision library. The chest X-ray repository was partitioned into four folds using three different partitioning levels. In image-level partitioning(folder: split1_random), images were randomly distributed across four folds. In patient-level partitioning (folder: split2_patient), all images from the same patient were assigned to the same fold. Finally, in dataset-level partitioning (folder: split3_dataset), each dataset was exclusively allocated to a separate fold.

Files

OCT_data.zip

Files (2.2 GB)

Name Size Download all
md5:ff62c5d875c888f507c409c534144f89
1.8 GB Preview Download
md5:f4b544d8b5b0d82af9dd5d7b4b3d5c1b
373.6 MB Preview Download

Additional details

Software

Repository URL
https://github.com/thepanlab/NACHOS
Programming language
Python