Published April 10, 2024 | Version v1
Dataset Open

MedIMeta: A comprehensive and easy-to-use multi-domain multi-task medical imaging meta-dataset

Description

We introduce the Medical Imaging Meta-Dataset (MedIMeta), a novel multi-domain, multi-task meta-dataset designed to facilitate the development and standardised evaluation of ML models and cross-domain few-shot learning algorithms for medical image classification. MedIMeta contains 19 medical imaging datasets spanning 10 different domains and encompassing 54 distinct medical tasks, offering opportunities for both single-task and multi-task training. All tasks are standardised to the same format and readily usable in PyTorch or other ML frameworks. All datasets have been previously published with an open license that allows redistribution or we obtained an explicit permission to do so.

Each dataset within the MedIMeta dataset is standardized to a size of 224 × 224 pixels which matches image size commonly used in pre-trained models. Furthermore, the dataset comes with pre-made splits to ensure ease of use and standardized benchmarking. We release a user-friendly Python package to directly load images for use in PyTorch.

Links

 

Dataset Overview

Dataset Name Dataset ID License Domain Task Names Task Targets # Labels
AML Cytomorphology aml CC BY-SA 4.0 Microscopy morphological class multi-class classification 15
Breast Ultrasound bus CC BY-SA 4.0 Breast ultrasound case category
malignancy
multi-class classification
binary classification
3
2
Colorectal Cancer Histopathology crc CC BY-SA 4.0 Histopathology tissue class multi-class classification 9
Chest X-ray Multi-disease cxr CC BY-SA 4.0 Chest X-ray disease labels
patient sex
multi-label classification
binary classification
14
2
Dermatoscopy derm CC BY-SA 4.0 Dermatoscopy disease category multi-class classification 7
Diabetic Retinopathy (Regular Fundus) dr_regular CC BY-SA 4.0 Retinal fundus DR level
Overall quality
Artifact
Clarity
Field definition
ordinal regression
binary classification
ordinal regression
ordinal regression
ordinal regression
5
2
6
5
5
Diabetic Retinopathy (Ultra-widefield Fundus) dr_uwf CC BY-SA 4.0 Retinal fundus DR level ordinal regression 5
Fundus Multi-disease fundus CC BY-SA 4.0 Retinal fundus disease presence
disease labels
binary classification
multi-label classification
2
45
Glaucoma-specific fundus images glaucoma CC BY-SA 4.0 Retinal fundus Glaucoma suspect binary classification 2
Mammography (Calcifications) mammo_calc CC BY-SA 4.0 Mammography pathology
calc type
calc distribution
binary classification
multi-label classification
multi-label classification
2
14
5
Mammography (Masses) mammo_mass CC BY-SA 4.0 Mammography pathology
mass shape
mass margins
binary classification
multi-label classification
multi-label classification
2
8
5
OCT oct CC BY-SA 4.0 OCT disease class
urgent referral
multi-class classification
binary classification
4
2
Axial Organ Slices organs_axial CC BY-NC-SA 4.0 Abdominal CT organ label multi-class classification 11
Coronal Organ Slices organs_coronal CC BY-NC-SA 4.0 Abdominal CT organ label multi-class classification 11
Sagittal Organ Slices organs_sagittal CC BY-NC-SA 4.0 Abdominal CT organ label multi-class classification 11
Peripheral Blood Cells pbc CC BY-SA 4.0 Microscopy cell class multi-class classification 8
Pediatric Pneumonia pneumonia CC BY-SA 4.0 Chest X-ray pneumonia presence
disease class
binary classification
multi-class classification
2
3
Skin Lesion Evaluation (Dermoscopy) skinl_derm CC BY-SA 4.0 Dermatoscopy Diagnosis
Diagnosis grouped
Pigment Network
Blue Whitish Veil
Vascular Structures
Vascular Structures grouped
Pigmentation
Pigmentation grouped
Streaks
Dots and Globules
Regression Structures
Regression Structures grouped
multi-class classification
multi-class classification
multi-class classification
binary classification
multi-class classification
multi-class classification
multi-class classification
multi-class classification
multi-class classification
multi-class classification
multi-class classification
binary classification
15
5
3
2
8
3
5
3
3
3
4
2
Skin Lesion Evaluation (Clinical Photography) skinl_photo CC BY-SA 4.0 Clinical skin imaging Diagnosis
Diagnosis grouped
Pigment Network
Blue Whitish Veil
Vascular Structures
Vascular Structures grouped
Pigmentation
Pigmentation grouped
Streaks
Dots and Globules
Regression Structures
Regression Structures grouped
multi-class classification
multi-class classification
multi-class classification
binary classification
multi-class classification
multi-class classification
multi-class classification
multi-class classification
multi-class classification
multi-class classification
multi-class classification
binary classification
15
5
3
2
8
3
5
3
3
3
4
2

Files

aml.zip

Files (52.3 GB)

Name Size Download all
md5:8fcaa86086c12bf324c3acc26d3e1ec8
4.2 GB Preview Download
md5:15e253aa4ad2a979529c440c297a8c64
54.8 MB Preview Download
md5:cb56f1bc13def7011457725cc030f419
25.2 GB Preview Download
md5:3f25d10360b4ffedf8afd5535d99809d
8.1 GB Preview Download
md5:3d27fec149dc82af5bc3b5e55d114e19
2.7 GB Preview Download
md5:c99254da25abe2e656b844c844d52fa7
328.5 MB Preview Download
md5:f26e06bfbb8d458432a852185806b64d
37.5 MB Preview Download
md5:e3ef6be180ec9583889d7ad8823e6eee
608.9 MB Preview Download
md5:a322c4762de97b510f07b94297038219
136.1 MB Preview Download
md5:d528a5703f8a22baa7c84ef3bf3122f8
109.7 MB Preview Download
md5:b69263af33bc46e9363b4370f7f1b88d
98.1 MB Preview Download
md5:969c2e5728ac03904c73e70803c02742
6.7 GB Preview Download
md5:7b4e2cdc9d67618bcf59e781c2156065
153.0 MB Preview Download
md5:bdb65abd2e4488618a7716d9246bb8a7
142.5 MB Preview Download
md5:26e6874add5a4bde6f00307f34c30f7b
131.5 MB Preview Download
md5:5136e76b284a00a065e90a301790e81d
2.9 GB Preview Download
md5:f3804c93129e6afb4233a3228da0082c
337.1 MB Preview Download
md5:5e7cdc68c444cc93d2f315131ca18df7
168.8 MB Preview Download
md5:2829b2070c1e24fe9d70795c49faba51
164.2 MB Preview Download

Additional details

Software

Programming language
Python
Development Status
Active