Published December 8, 2015 | Version v1
Dataset Open

Data from: Computational pathology to discriminate benign from malignant intraductal proliferations of the breast

Description

The categorization of intraductal proliferative lesions of the breast based on routine light microscopic examination of histopathologic sections is in many cases challenging, even for experienced pathologists. The development of computational tools to aid pathologists in the characterization of these lesions would have great diagnostic and clinical value. As a first step to address this issue, we evaluated the ability of computational image analysis to accurately classify DCIS and UDH and to stratify nuclear grade within DCIS. Using 116 breast biopsies diagnosed as DCIS or UDH from the Massachusetts General Hospital (MGH), we developed a computational method to extract 392 features corresponding to the mean and standard deviation in nuclear size and shape, intensity, and texture across 8 color channels. We used L1-regularized logistic regression to build classification models to discriminate DCIS from UDH. The top-performing model contained 22 active features and achieved an AUC of 0.95 in cross-validation on the MGH data-set. We applied this model to an external validation set of 51 breast biopsies diagnosed as DCIS or UDH from the Beth Israel Deaconess Medical Center, and the model achieved an AUC of 0.86. The top-performing model contained active features from all color-spaces and from the three classes of features (morphology, intensity, and texture), suggesting the value of each for prediction. We built models to stratify grade within DCIS and obtained strong performance for stratifying low nuclear grade vs. high nuclear grade DCIS (AUC = 0.98 in cross-validation) with only moderate performance for discriminating low nuclear grade vs. intermediate nuclear grade and intermediate nuclear grade vs. high nuclear grade DCIS (AUC = 0.83 and 0.69, respectively). These data show that computational pathology models can robustly discriminate benign from malignant intraductal proliferative lesions of the breast and may aid pathologists in the diagnosis and classification of these lesions.

Notes

Files

_SelectedFeatures.csv

Files (1.6 GB)

Name Size Download all
md5:7a56321bbc030d0487c4e3f15eeb871f
1.2 kB Preview Download
md5:dbd023d84197fc5cf18ae4cd3a8ed7a0
13.4 kB Download
md5:97ea4d6682b94f34ce9be2ee753b76f6
92.2 kB Download
md5:0ad49d4149337a08d6df805fb2900231
576.9 MB Download
md5:f5e4e11f8b1b6d86afbd08d2aacc22b7
732.3 kB Preview Download
md5:4714b00e1a447cd4b49270ffc0b1e7ce
1.7 MB Download
md5:38d9eab08421ee363e876f7007d2ff4a
1.6 kB Download
md5:5d95676c9fd8e88f11e9894737f8768c
968.6 MB Preview Download
md5:c04ffa535a37742ec8917babf6eaea39
13.0 MB Preview Download
md5:d9543b50395122f5888e3736b450f251
31.8 kB Download

Additional details

Related works