Published January 30, 2025 | Version 1.0
Dataset Open

OCT Dataset for Segmentation of Atherosclerotic Plaque Morphological Features

  • 1. ROR icon Pompeu Fabra University
  • 2. Symfa
  • 3. ROR icon Siberian State Medical University
  • 4. Research Institute for Complex Issues of Cardiovascular Diseases

Description

Objectives: The primary goal of this dataset is to enable the automated segmentation and quantification of atherosclerotic plaque features in OCT images. Cardiovascular disease, with atherosclerosis at its core, remains a global health challenge. Accurate identification of vulnerable plaques is crucial for preventing acute cardiovascular events such as myocardial infarction and stroke. OCT imaging provides high-resolution insights into plaque morphology but is often constrained by manual interpretation challenges. This dataset, curated with diverse annotations of key plaque morphological features, aims to facilitate the development and evaluation of machine learning models for precise plaque analysis. By advancing segmentation capabilities, this dataset contributes to improved diagnostics and therapeutic strategies in cardiovascular care.

Ethical Approval: The dataset complies with ethical standards, adhering to the Declaration of Helsinki. Ethical approval was granted by the Local Ethical Committee of the Research Institute for Complex Issues of Cardiovascular Diseases (Kemerovo, Russia) under protocol code 2022/06 (approved on June 30, 2022). All participants provided informed consent. Data collection involved patients aged 18 years or older, ensuring balanced gender representation and inclusion of various comorbid conditions for comprehensive clinical relevance (refer to Table 1).

Description: The dataset consists of OCT images acquired from 103 patients across two cardiovascular research centers. These images, collected over one year, represent a diverse array of imaging devices and patient demographics. The dataset includes 25,698 annotated slices, each capturing key plaque morphological features. These features include lumen (LM), fibrous cap (FC), lipid core (LC), and vasa vasorum (VV). The images vary in dimensions from 704 x 704 to 1024 x 1024 pixels, reflecting differences in anatomical characteristics and imaging conditions. Annotations were performed using Supervisely, with meticulous double-verification processes to ensure accuracy.

Annotation Method: Two cardiologists annotated the dataset, identifying plaque features using binary masks. The annotations underwent a review and double-verification by a senior cardiologist and technical specialist, enhancing precision and consistency. The morphological features segmented include the vascular lumen, fibrous cap, lipid core, and vasa vasorum, each providing critical insights into plaque stability and cardiovascular risk.

Dataset Split: A 5-fold cross-validation technique was employed for dataset splitting, ensuring robust model evaluation while preventing data leakage. Approximately 80% of images were allocated for training in each fold, with the remaining 20% reserved for testing (refer to Table 2). This method allowed a balanced and comprehensive assessment of segmentation performance across the dataset.

Access to the Study: Further information about this study, including curated source code, dataset details, and trained models, can be accessed through the following repositories:

 

Table 1. Baseline characteristics of patients included in the study.

Parameter

Value

Sex:

 

Male, n (%)

77 (74.7)

Female, n (%)

26 (25.3)

Median Age, years [min – max]

69 [43 – 83]

Arterial hypertension, n (%)

92 (89.3)

Diabetes Mellitus, n (%)

22 (21.4)

Myocardial Infarction, n (%)

22 (21.4)

Polyvascular Disease, n (%)

29 (28.2)

Angina Pectoris:

 

Silent ischemia, n (%)

9 (8.7)

Functional class 1, n (%)

24 (23.3)

Functional class 2, n (%)

55 (53.4)

Functional class 3, n (%)

15 (14.6)

 

Table 2. Image and plaque morphological feature distributions across folds and subsets.

Fold Subset LM FC LC VV Total objects Total images
1 Train 17264 5610 5576 328 28778 16901
1 Test 4544 1616 1616 122 7898 4492
2 Train 17554 5709 5690 237 29190 17207
2 Test 4254 1517 1502 213 7486 4186
3 Train 17220 5600 5565 407 28792 16962
3 Test 4588 1626 1627 43 7884 4431
4 Train 17813 5724 5686 416 29639 17473
4 Test 3995 1502 1506 34 7037 3920
5 Train 17381 6261 6251 412 30405 17029
5 Test 4427 965 941 38 6371 4364

 

Files

Figure 1. Annotation methodology.png

Files (82.1 GB)

Name Size Download all
md5:7444d83e8adc6bce0d0f570248a44e37
1.9 MB Preview Download
md5:3bef0c0229de65a2cb618b18f97ac949
12.1 GB Preview Download
md5:8aa65b0d2008c287ca80fc4b1a219722
12.1 GB Preview Download
md5:c53045ee5dd0c3c243d95253bfd4d0ef
12.1 GB Preview Download
md5:af7dd83a88af4f9034db0d6b11f4a4ff
12.1 GB Preview Download
md5:8c998cdddb40861b788dd84986c08252
12.1 GB Preview Download
md5:3c3ae36b6c09dff946f146c149145374
21.5 GB Preview Download
md5:a0f7700924eda6c70f0f546b35125d67
26.7 MB Preview Download

Additional details

Related works

Is part of
Journal article: 10.1016/j.compbiomed.2025.111061 (DOI)

Funding

Russian Science Foundation
Development of an automated deep learning system for detecting unstable plaques in optical coherence tomography 23-75-10009

Software

Repository URL
https://github.com/ViacheslavDanilov/oct_segmentation
Programming language
Python
Development Status
Active