Self-Supervision for Medical Image Classification: State-of-the-Art Performance with ~100 Labeled Training Samples per Class Models
Authors/Creators
Description
Pretrained Models associated with: Self-Supervision for Medical Image Classification: State-of-the-Art Performance with ~100 Labeled Training Samples per Class
Is self-supervised deep learning (DL) for medical image analysis already a serious alternative to the de facto standard of end-to-end trained supervised DL? We tackle this question for medical image classification, with a particular focus on one of the currently most limiting factor of the field: the (non-)availability of labeled data. Based on three common medical imaging modalities (bone marrow microscopy, gastrointestinal endoscopy, dermoscopy) and publicly available data sets, we analyze the performance of self-supervised DL within the self-distillation with no labels (DINO) framework. After learning an image representation without use of image labels, conventional machine learning classifiers are applied. The classifiers are fit using a systematically varied number of labeled data (1–1000 samples per class). Exploiting the learned image representation, we achieve state-of-the-art classification performance for all three imaging modalities and data sets with only a fraction of between 1% and 10% of the available labeled data and about 100 labeled samples per class.
Files
Files
(578.7 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:4afabb89619545ec1e7a72c14afd1c9a
|
192.9 MB | Download |
|
md5:d4d0142721165faf06aea12fe90d0111
|
192.9 MB | Download |
|
md5:826eea56c438de7a898819cf943df263
|
192.9 MB | Download |
Additional details
References
- Nielsen, Maximilian, Laura Wenderoth, Thilo Sentker, and René Werner. 2023. "Self-Supervision for Medical Image Classification: State-of-the-Art Performance with ~100 Labeled Training Samples per Class" Bioengineering 10, no. 8: 895. https://doi.org/10.3390/bioengineering10080895