Published September 9, 2021 | Version v1
Journal article Open

Diagnosis of cadmium contamination in urban and suburban soils using visible-to-near-infrared spectroscopy

  • 1. School of Resource and Environmental Sciences, Wuhan University; Department of Environment, Ghent University
  • 2. School of Resource and Environmental Sciences, Wuhan University
  • 3. Hubei Academy of Environmental Sciences, Wuhan
  • 4. Institute of Agricultural Remote Sensing and Information Technology Application, College of Environmental and Resource Sciences, Zhejiang University
  • 5. School of Remote Sensing and Information Engineering, Wuhan University
  • 6. College of Resources and Environment, Huazhong Agricultural University, Wuhan
  • 7. South China Institute of Environmental Sciences, Ministry of Ecology and Environment, Guangzhou
  • 8. Department of Environment, Ghent University

Description

Previous studies have mostly focused on using visible-to-near-infrared spectral technique to quantitatively estimate soil cadmium (Cd) content, whereas little attention has been paid to identifying soil Cd contamination from a perspective of spectral classification. Here, we developed a framework to compare the potential of two spectral transformations (i.e., raw reflectance and continuum removal [CR]), three optimization strategies (i.e., full-spectrum, Boruta feature selection, and synthetic minority over-sampling technique [SMOTE]), and three classification algorithms (i.e., partial least squares discriminant analysis, random forest [RF], and support vector machine) for diagnosing soil Cd contamination. A total of 536 soil samples were collected from urban and suburban areas located in Wuhan City, China. Specifically, Boruta and SMOTE strategies were aimed at selecting the most informative predictors and obtaining balanced training datasets, respectively. Results indicated that soils contaminated by Cd induced decrease in spectral reflectance magnitude. Classification models developed after Boruta and SMOTE strategies out-performed to those from full-spectrum. A diagnose model combining CR preprocessing, SMOTE strategy, and RF algorithm achieved the highest validation accuracy for soil Cd (Kappa = 0.74). This study provides a theoretical reference for rapid identification of and monitoring of soil Cd contamination in urban and suburban areas.

Files

1-s2.0-S0269749121017103-main_SI.pdf

Files (4.6 MB)

Name Size Download all
md5:3887a2229ea0e52b9983b3c3b2d33b01
4.6 MB Preview Download