Published 2026 | Version v2
Dataset Open

MAGIC-CT: Multiorgan Annotation and Grounded Image Captioning in CT for Cancer

Description

We present MAGIC-CT, a contrast-enhanced CT dataset of 562 patients with abdominal tumors (liver cysts/cancer, lung metastases, lung cancer, kidney cysts, renal cancer, pancreatic cancer). All 562 patients have CT scans and 3D lesion/organ masks; a subset of 492 patients also have organ-wise, radiologist-authored reports (RU/KZ/EN) totaling 4,937 organ descriptions. The dataset spans 8 pathologies across 4 organs, with about 1,250 annotated lesions and about 500 lesion-linked textual findings, enabling training of multimodal systems that connect volumetric localization to clinical language. MAGIC-CT uniquely integrates volumetric lesion localization, quantitative metrics (e.g., tumor volume, angular involvement of vasculature), and rich semantic context (e.g., "cuff-like encasement of the celiac trunk"), addressing the lack of resources bridging radiological imaging, segmentation, and clinical language. This dataset is expected to enable advancements in AI-driven tumor characterization, automated report generation, and metastasis tracking, with implications for precision oncology.
 
Data Use Agreement

If I am granted access to the database:

I will not attempt to identify any individual or institution referenced in MagicCT data.
I will exercise all reasonable and prudent care to avoid disclosure of the identity of any individual or institution referenced in MagicCT restricted data in any publication or other communication.
I will not share access to MagicCT restricted data with anyone else.
I will exercise all reasonable and prudent care to maintain the physical and electronic security of MagicCT restricted data.
If I find information within MagicCT restricted data that I believe might permit identification of any individual or institution, I will report the location of this information promptly by email to the corresponding author, citing the location of the specific information in question.
I have requested access to MagicCT restricted data for the sole purpose of lawful use in scientific research, and I will use my privilege of access, if it is granted, for this purpose and no other.
I have completed a training program in human research subject protections and HIPAA regulations, and I am submitting proof of having done so.
I will indicate the general purpose for which I intend to use the database in my application.
If I openly disseminate my results, I will also contribute the code used to produce those results to a repository that is open to the research community.
This agreement may be terminated by either party at any time, but my obligations with respect to MagicCT data shall continue after termination.

License: CC0-1.0. Contact: maxim.popov@nu.edu.kz

Files

hcc.json

Files (39.1 GB)

Name Size Download all
md5:0bc86330a5bcf3367ee8b139e96a8abf
420.3 kB Preview Download
md5:70912568e02ce26f306a51b020c312bf
3.8 GB Preview Download
md5:2f5aac68429e04ae011561b3759c0fc7
1.7 MB Preview Download
md5:e236a9d1096304cdfbc3c12f098cd59b
2.6 GB Preview Download
md5:105c0198e762da7d16b57e22f072230f
3.1 GB Preview Download
md5:97d5380e805d8bc323ab4cd1895e243d
3.1 GB Preview Download
md5:4af90db47deac5016e49841ec041b0a6
5.4 GB Preview Download
md5:bc0806e854626b1ef8c04885ce5e13c3
2.5 GB Preview Download
md5:6e5f3e5bb8b866bc6dcddfb264790937
862.2 kB Preview Download
md5:a577e9095ab601b3706a89d6e5a0e2b5
742.3 kB Preview Download
md5:0998c122daddd4fddcacce1e239edb46
8.4 GB Preview Download
md5:31d6c8744cbfbb0fb743e3f2b40d7a09
51.3 kB Preview Download
md5:ca8baae2ef93184b87df2ed66c8c3395
762.2 MB Preview Download
md5:98c269ca13ee68e719f3eea8318a122c
338.7 kB Preview Download
md5:61e147bfff5f9e7d2cad8be5c6be3a7a
3.4 GB Preview Download
md5:c9aff9bcee90d5e4689439bb44b8e7d8
306.1 kB Preview Download
md5:11dfe710d22a21110fef3c74b94c611c
2.4 GB Preview Download
md5:63a65ae6afe7241c3c234fa2ff1f3787
235.9 kB Preview Download
md5:18182fe09ffbab04779283b715028a41
1.7 GB Preview Download
md5:d75322905edc8e7bdf6da814142256b4
252.2 kB Preview Download
md5:3ad35735b123a2527550f471c99ae953
1.9 GB Preview Download

Additional details

Funding

Nazarbayev University
Collaborative Research Program of Nazarbayev University 111024CRP2007

Software

Repository URL
https://github.com/maxtrubetskoy/MagicCT
Programming language
Python
Development Status
Active