Journal article Open Access
The extensive amount of Earth observation satellite images available brings opportunities and challenges for land mapping in global and regional scales. These large data sets have motivated the use of satellite image time series analysis coupled with machine learning techniques to produce land use and cover class maps. To be successful, these methods need good quality training samples, which are the most important factor for determining the accuracy of the results. For this reason, training samples need methods for quality control of class noise. In this paper, we propose a method to assess and improve the quality of satellite image time series training data. The method uses self-organizing maps (SOM) to produce clusters of time series and Bayesian inference to assess intra-cluster and inter-cluster similarity. Consistent samples of a class will be part of a neighborhood of clusters in the SOM map. Noisy samples will appear as outliers in the SOM. Using Bayesian inference in the SOM neighborhoods, we can infer which samples are noisy. To illustrate the methods, we present a case study in a large training set of land use and cover classes in the Cerrado biome, Brazil. The results prove that the method is efficient to reduce class noise and to assess the spatio-temporal variation of satellite image time series training samples.
Santos et al. - 2021 - Quality control and class noise reduction of satel.pdf