MedIMeta: A comprehensive and easy-to-use multi-domain multi-task medical imaging meta-dataset
Description
We introduce the Medical Imaging Meta-Dataset (MedIMeta), a novel multi-domain, multi-task meta-dataset designed to facilitate the development and standardised evaluation of ML models and cross-domain few-shot learning algorithms for medical image classification. MedIMeta contains 19 medical imaging datasets spanning 10 different domains and encompassing 54 distinct medical tasks, offering opportunities for both single-task and multi-task training. All tasks are standardised to the same format and readily usable in PyTorch or other ML frameworks. All datasets have been previously published with an open license that allows redistribution or we obtained an explicit permission to do so.
Each dataset within the MedIMeta dataset is standardized to a size of 224 × 224 pixels which matches image size commonly used in pre-trained models. Furthermore, the dataset comes with pre-made splits to ensure ease of use and standardized benchmarking. We release a user-friendly Python package to directly load images for use in PyTorch.
Links
- Project website: https://www.woerner.eu/projects/medimeta/
- Data loading code (medimeta Python package): https://github.com/StefanoWoerner/medimeta-pytorch
- Data creation code: https://github.com/StefanoWoerner/medimeta-dataset-scripts
Dataset Overview
Dataset Name | Dataset ID | License | Domain | Task Names | Task Targets | # Labels |
AML Cytomorphology | aml | CC BY-SA 4.0 | Microscopy | morphological class | multi-class classification | 15 |
Breast Ultrasound | bus | CC BY-SA 4.0 | Breast ultrasound | case category malignancy |
multi-class classification binary classification |
3 2 |
Colorectal Cancer Histopathology | crc | CC BY-SA 4.0 | Histopathology | tissue class | multi-class classification | 9 |
Chest X-ray Multi-disease | cxr | CC BY-SA 4.0 | Chest X-ray | disease labels patient sex |
multi-label classification binary classification |
14 2 |
Dermatoscopy | derm | CC BY-SA 4.0 | Dermatoscopy | disease category | multi-class classification | 7 |
Diabetic Retinopathy (Regular Fundus) | dr_regular | CC BY-SA 4.0 | Retinal fundus | DR level Overall quality Artifact Clarity Field definition |
ordinal regression binary classification ordinal regression ordinal regression ordinal regression |
5 2 6 5 5 |
Diabetic Retinopathy (Ultra-widefield Fundus) | dr_uwf | CC BY-SA 4.0 | Retinal fundus | DR level | ordinal regression | 5 |
Fundus Multi-disease | fundus | CC BY-SA 4.0 | Retinal fundus | disease presence disease labels |
binary classification multi-label classification |
2 45 |
Glaucoma-specific fundus images | glaucoma | CC BY-SA 4.0 | Retinal fundus | Glaucoma suspect | binary classification | 2 |
Mammography (Calcifications) | mammo_calc | CC BY-SA 4.0 | Mammography | pathology calc type calc distribution |
binary classification multi-label classification multi-label classification |
2 14 5 |
Mammography (Masses) | mammo_mass | CC BY-SA 4.0 | Mammography | pathology mass shape mass margins |
binary classification multi-label classification multi-label classification |
2 8 5 |
OCT | oct | CC BY-SA 4.0 | OCT | disease class urgent referral |
multi-class classification binary classification |
4 2 |
Axial Organ Slices | organs_axial | CC BY-NC-SA 4.0 | Abdominal CT | organ label | multi-class classification | 11 |
Coronal Organ Slices | organs_coronal | CC BY-NC-SA 4.0 | Abdominal CT | organ label | multi-class classification | 11 |
Sagittal Organ Slices | organs_sagittal | CC BY-NC-SA 4.0 | Abdominal CT | organ label | multi-class classification | 11 |
Peripheral Blood Cells | pbc | CC BY-SA 4.0 | Microscopy | cell class | multi-class classification | 8 |
Pediatric Pneumonia | pneumonia | CC BY-SA 4.0 | Chest X-ray | pneumonia presence disease class |
binary classification multi-class classification |
2 3 |
Skin Lesion Evaluation (Dermoscopy) | skinl_derm | CC BY-SA 4.0 | Dermatoscopy | Diagnosis Diagnosis grouped Pigment Network Blue Whitish Veil Vascular Structures Vascular Structures grouped Pigmentation Pigmentation grouped Streaks Dots and Globules Regression Structures Regression Structures grouped |
multi-class classification multi-class classification multi-class classification binary classification multi-class classification multi-class classification multi-class classification multi-class classification multi-class classification multi-class classification multi-class classification binary classification |
15 5 3 2 8 3 5 3 3 3 4 2 |
Skin Lesion Evaluation (Clinical Photography) | skinl_photo | CC BY-SA 4.0 | Clinical skin imaging | Diagnosis Diagnosis grouped Pigment Network Blue Whitish Veil Vascular Structures Vascular Structures grouped Pigmentation Pigmentation grouped Streaks Dots and Globules Regression Structures Regression Structures grouped |
multi-class classification multi-class classification multi-class classification binary classification multi-class classification multi-class classification multi-class classification multi-class classification multi-class classification multi-class classification multi-class classification binary classification |
15 5 3 2 8 3 5 3 3 3 4 2 |
Files
aml.zip
Files
(52.3 GB)
Name | Size | Download all |
---|---|---|
md5:8fcaa86086c12bf324c3acc26d3e1ec8
|
4.2 GB | Preview Download |
md5:15e253aa4ad2a979529c440c297a8c64
|
54.8 MB | Preview Download |
md5:cb56f1bc13def7011457725cc030f419
|
25.2 GB | Preview Download |
md5:3f25d10360b4ffedf8afd5535d99809d
|
8.1 GB | Preview Download |
md5:3d27fec149dc82af5bc3b5e55d114e19
|
2.7 GB | Preview Download |
md5:c99254da25abe2e656b844c844d52fa7
|
328.5 MB | Preview Download |
md5:f26e06bfbb8d458432a852185806b64d
|
37.5 MB | Preview Download |
md5:e3ef6be180ec9583889d7ad8823e6eee
|
608.9 MB | Preview Download |
md5:a322c4762de97b510f07b94297038219
|
136.1 MB | Preview Download |
md5:d528a5703f8a22baa7c84ef3bf3122f8
|
109.7 MB | Preview Download |
md5:b69263af33bc46e9363b4370f7f1b88d
|
98.1 MB | Preview Download |
md5:969c2e5728ac03904c73e70803c02742
|
6.7 GB | Preview Download |
md5:7b4e2cdc9d67618bcf59e781c2156065
|
153.0 MB | Preview Download |
md5:bdb65abd2e4488618a7716d9246bb8a7
|
142.5 MB | Preview Download |
md5:26e6874add5a4bde6f00307f34c30f7b
|
131.5 MB | Preview Download |
md5:5136e76b284a00a065e90a301790e81d
|
2.9 GB | Preview Download |
md5:f3804c93129e6afb4233a3228da0082c
|
337.1 MB | Preview Download |
md5:5e7cdc68c444cc93d2f315131ca18df7
|
168.8 MB | Preview Download |
md5:2829b2070c1e24fe9d70795c49faba51
|
164.2 MB | Preview Download |
Additional details
Software
- Programming language
- Python
- Development Status
- Active