CURVAS: Calibration and Uncertainty for multiRater Volume Assessment in multiorgan Segmentation

Riera i Marín, Meritxell; García López, Javier; Kleiss, Joy; O K, Shika; Galdrán, Adrian; May, Matthias; González-Ballester, Miguel Angel; Rodríguez Comas, Júlia; Schmidt, Maximilan; Hessman, Christopher; Aubanell, Antón; Antolín, Andreu

doi:10.5281/zenodo.10979642

Published April 16, 2024 | Version v1

Other Open

CURVAS: Calibration and Uncertainty for multiRater Volume Assessment in multiorgan Segmentation

1. Sycai Medical / Universitat Pompeu Fabra (UPF)
2. Sycai Medical
3. Universitätsklinikum Erlangen
4. Universitat Pompeu Fabra (UPF)
5. Hospital de Sant Pau
6. Hospital Vall d'Hebron

In medical imaging, DL models are often tasked with delineating structures or abnormalities within complex anatomical structures, such as tumors, blood vessels, or organs. Uncertainty arises from the inherent complexity and variability of these structures, leading to challenges in precisely defining their boundaries. This uncertainty is further compounded by interrater variability, as different medical experts may have varying opinions on where the true boundaries lie. DL models must grapple with these discrepancies, leading to inconsistencies in segmentation results across different annotators and potentially impacting diagnosis and treatment decisions. Addressing interrater variability in DL for medical segmentation involves the development of robust algorithms capable of capturing and quantifying uncertainty, as well as standardizing annotation practices and promoting collaboration among medical experts to reduce variability and improve the reliability of DL-based medical image analysis. Interrater variability poses significant challenges in the field of DL for medical image segmentation.

Furthermore, achieving model calibration, a fundamental aspect of reliable predictions, becomes notably challenging when dealing with multiple classes and raters. Calibration is pivotal for ensuring that predicted probabilities align with the true likelihood of events, enhancing the model's reliability. It must be considered that, even if not clearly, having multiple classes account for uncertainties arising from their interactions. Morevoer, incorporating annotations from multiple raters adds another layer of complexity, as differing expert opinions may contribute to a broader spectrum of variability and computational complexity.

Consequently, the development of robust algorithms capable of effectively capture and quantify variability and uncertainty, while also accommodating the nuances of multi-class and multi-rater scenarios, becomes imperative. Striking a balance between model calibration, accurate segmentation and handling variability in medical annotations is crucial for the success and reliability of DL-based medical image analysis.

The data used for this challenge consists of abdominal CTs in which pancreas, liver and kidneys are labeled. Each CT will be labeled by three radiologists.

Files

Calibration and Uncertainty for multiRater Volume.pdf

Files (113.1 kB)

Name	Size	Download all
Calibration and Uncertainty for multiRater Volume.pdf md5:30853e730752e875e0f12b10d580e7b0	113.1 kB	Preview Download

	All versions	This version
Views	565	565
Downloads	408	408
Data volume	49.6 MB	49.6 MB

CURVAS: Calibration and Uncertainty for multiRater Volume Assessment in multiorgan Segmentation

Authors/Creators

Description

Files

Calibration and Uncertainty for multiRater Volume.pdf

Files (113.1 kB)