There is a newer version of the record available.

Published April 27, 2022 | Version 2.1
Dataset Open

MedMNIST v2: A Large-Scale Lightweight Benchmark for 2D and 3D Biomedical Image Classification

  • 1. ROR icon Shanghai Jiao Tong University
  • 2. ROR icon Harvard University
  • 3. ROR icon Boston College
  • 4. ROR icon RWTH Aachen University
  • 5. Zhongshan Hospital Affiliated to Fudan University
  • 6. Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine

Description

Abstract

We introduce MedMNIST v2, a large-scale MNIST-like collection of standardized biomedical images, including 12 datasets for 2D and 6 datasets for 3D. All images are pre-processed into 28x28 (2D) or 28x28x28 (3D) with the corresponding classification labels, so that no background knowledge is required for users. Covering primary data modalities in biomedical images, MedMNIST v2 is designed to perform classification on lightweight 2D and 3D images with various data scales (from 100 to 100,000) and diverse tasks (binary/multi-class, ordinal regression and multi-label). The resulting dataset, consisting of 708,069 2D images and 10,214 3D images in total, could support numerous research / educational purposes in biomedical image analysis, computer vision and machine learning. We benchmark several baseline methods on MedMNIST v2, including 2D / 3D neural networks and open-source / commercial AutoML tools. The data and code are publicly available at https://medmnist.com/.

Note: This dataset is NOT intended for clinical use.

We recommend our official code to download, parse and use the MedMNIST dataset:

pip install medmnist

 

Citation

If you find this project useful, please cite both v1 and v2 paper as:

Jiancheng Yang, Rui Shi, Donglai Wei, Zequan Liu, Lin Zhao, Bilian Ke, Hanspeter Pfister, Bingbing Ni. Yang, Jiancheng, et al. "MedMNIST v2-A large-scale lightweight benchmark for 2D and 3D biomedical image classification." Scientific Data, 2023.

Jiancheng Yang, Rui Shi, Bingbing Ni. "MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis". IEEE 18th International Symposium on Biomedical Imaging (ISBI), 2021.

or using bibtex:

@article{medmnistv2,
    title={MedMNIST v2-A large-scale lightweight benchmark for 2D and 3D biomedical image classification},
    author={Yang, Jiancheng and Shi, Rui and Wei, Donglai and Liu, Zequan and Zhao, Lin and Ke, Bilian and Pfister, Hanspeter and Ni, Bingbing},
    journal={Scientific Data},
    volume={10},
    number={1},
    pages={41},
    year={2023},
    publisher={Nature Publishing Group UK London}
}

@inproceedings{medmnistv1,
    title={MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis},
    author={Yang, Jiancheng and Shi, Rui and Ni, Bingbing},
    booktitle={IEEE 18th International Symposium on Biomedical Imaging (ISBI)},
    pages={191--195},
    year={2021}
}

Please also cite the corresponding paper(s) of source data if you use any subset of MedMNIST as per the description on the project website.

 

License

The MedMNIST dataset is licensed under Creative Commons Attribution 4.0 International (CC BY 4.0), except DermaMNIST under Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0).

The code is under Apache-2.0 License.

 

Changelog

v2.1 (this repository): We have fixed the mistake in the file of NoduleMNIST3D (i.e., nodulemnist3d.npz). More details in this issue.

v2.0: Initial repository of MedMNIST v2, add 6 datasets for 3D and 2 for 2D. 

v1.0: Initial repository of MedMNIST v1, 10 datasets for 2D.

Files

Files (705.8 MB)

Name Size Download all
md5:bbd3c5a5576322bc4cdfea780653b1ce
276.8 kB Download
md5:7053d0359d879ad8a5505303e11de1dc
35.5 MB Download
md5:750601b1f35ba3300ea97c75c52ff8f6
559.6 kB Download
md5:02c8a6516a18b556561a56cbdd36c4a8
82.8 MB Download
md5:0744692d530f8e62ec473284d019b0c7
19.7 MB Download
md5:6aa7b0143a6b42da40027a9dda61302f
3.3 MB Download
md5:8755a7e9e05a4d9ce80a24c3e7a256f3
29.3 MB Download
md5:c68d92d5b585d8d81f7112f81e2d0842
54.9 MB Download
md5:866b832ed4eeba67bfb9edee1d5544e6
38.2 MB Download
md5:0afa5834fb105f7705a7d93372119a21
15.5 MB Download
md5:21f0a239e7f502e6eca33c3fc453c0b6
32.7 MB Download
md5:e5c39f1af030238290b9557d9503af9d
16.5 MB Download
md5:a8b06965200029087d5bd730944a56c1
205.6 MB Download
md5:28209eda62fecd6e6a2d98b1501bb15f
4.2 MB Download
md5:bd4c0672f1bba3e3a89f0e4e876791e4
3.3 MB Download
md5:1235b78a3cd6280881dd7850a78eadb6
38.0 MB Download
md5:ebe78ee8b05294063de985d821c1c34b
125.0 MB Download
md5:2ba5b80617d705141f3f85627108fce8
398.4 kB Download