Journal article Open Access
D. Ivanov; V. Dremin; T. Genova; A. Bykov; T. Novikova; R. Ossikovski; I. Meglinski
In biophotonics, novel techniques and approaches are being constantly sought to assist medical doctors and to increase both sensitivity and specificity of the existing diagnostic methods. In such context, tissue polarimetry holds promise to become a valuable optical diagnostic technique as it is sensitive to tissue alterations caused by different benign and malignant formations. In our studies, multiple Mueller matrices were recorded for formalinfixed, human, ex vivo colon specimens containing healthy and tumor zones. The available data were pre-processed to filter noise and experimental errors, and then all Mueller matrices were decomposed to derive polarimetric quantities sensitive to malignant formations in tissues. In addition, the Poincaré sphere representation of the experimental results was implemented. We also used the canonical and natural indices of polarimetric purity depolarization spaces for plotting our experimental data. A feature selection was used to perform a statistical analysis and normalization procedure on the available data, in order to create a polarimetric model for colon cancer assessment with strong predictors. Both unsupervised (principal component analysis) and supervised (logistic regression, random forest, and support vector machines) machine learning algorithms were used to extract particular features from the model and for classification purposes. The results from logistic regression allowed to evaluate the best polarimetric quantities for tumor detection, while the use of random forest yielded the highest accuracy values. Attention was paid to the correlation between the predictors in the model as well as both losses and relative risk of misclassification. Apart from the mathematical interpretation of the polarimetric quantities, the presented polarimetric model was able to support the physical interpretation of the results from previous studies and relate the latter to the samples’ health condition, respectively.