Published September 16, 2025 | Version v1
Dataset Open

SpurBreast Dataset

Authors/Creators

Description

The SpurBreast dataset is a curated real-world breast MRI resource designed to study how deep learning models exploit spurious correlations. Built on top of the DUKE Breast Cancer dataset of 3D MRI scans from over 900 patients with biopsy-confirmed invasive breast cancer, SpurBreast introduces carefully constructed training and validation splits that either deliberately include or avoid non-clinical signals. These splits cover demographic and imaging-protocol features such as magnetic field strength, vertical orientation, menopause status, race and ethnicity, and surgery type.

Files

baseline_large.zip

Files (10.6 GB)

Name Size Download all
md5:0642b392fa7cc62c33330309d76cd995
2.0 GB Preview Download
md5:1cae6509142689b71cc40cbd45bf0169
972.8 MB Preview Download
md5:84b37b8ae7e0e2ffdf297753bb55640a
1.6 GB Preview Download
md5:dbe61da7dc7b06c69c10dbbea0a13b40
905.9 MB Preview Download
md5:a2cba3f7b561e86ea5f44e94bad31559
992.3 MB Preview Download
md5:0c7b1342b37f4e5fa5cd8f4c2b1633cd
649.2 MB Preview Download
md5:41814776dfae37b44e020f2ef0da3ef0
803.0 MB Preview Download
md5:0c7f77e2cc2c51ce1914ee4a64fd4cab
2.7 GB Preview Download

Additional details