Published June 26, 2023 | Version v1
Software Open

Land cover classification and mapping of a polar desert in the Canadian Arctic Archipelago

Description

The use of remote sensing for developing land cover maps in the Arctic has grown considerably in the last two decades, especially for monitoring the effects of climate change. The main challenge is to link information extracted from satellite imagery to ground covers due to the fine-scale spatial heterogeneity of Arctic ecosystems. There is currently no commonly accepted methodological scheme for high-latitude land cover mapping, but the use of remote sensing in Arctic ecosystem mapping would benefit from a coordinated sharing of lessons learned and best practices. Here, we aimed to produce a highly accurate land cover map of the surroundings of the Canadian Forces Station Alert, a polar desert on the northeastern tip of Ellesmere Island (Nunavut, Canada) by testing different predictors and classifiers. To account for the effect of the bare soil background and water limitations that are omnipresent at these latitudes, we included as predictors soil-adjusted vegetation indices and several hydrological predictors related to waterbodies and snowbanks. We compared the results obtained from an ensemble classifier based on a majority voting algorithm to eight commonly used classifiers. The distance to the nearest snowbank and soil-adjusted indices were the top predictors allowing the discrimination of land cover classes in our study area. The overall accuracy of the classifiers ranged between 75 and 88%, with the ensemble classifier also yielding a high accuracy (85%) and producing less bias than the individual classifiers. Some challenges remained, such as shadows created by boulders and snow covered by soil material. We provide recommendations for further improving classification methodology in the High Arctic, which is important for the monitoring of Arctic ecosystems exposed to ongoing polar amplification.

Notes

The dataset includes a shapefile named reference.shp, which consists of a collection of files with a common filename prefix, stored in the same directory. The shapefile stores the location, shape (point in this case), and attribute of the 467 reference points. The attribute of each point includes the land cover class among the following: bareground (refers to forb-dominated barren), mesic (refers to forb-dominated tundra), wetgrass (refers to grass-dominated wetland), wetsedge (refers to sedge-dominated wetland), wetmoss (refers to moss-dominated wetland), water, and snow. The shapefile can be opened in geographic information system (GIS) software such as QGIS (QGIS Development Team) and ArcGIS (ESRI). The geographic coordinate system is NAD 1983 (EPSG:4269) and the projected coordinate system is NAD 1983 UTM Zone 20N (EPSG:26920).

The dataset also includes a file named ensemble_classifier.tif. It is a raster GIS file in GeoTIFF format with 0.5 x 0.5 meter resolution. The uncompressed size is 382,48 MB. It can be opened in GIS software such as QGIS and ArcGIS. The geographic coordinate system is NAD 1983 (EPSG:4269) and the projected coordinate system is NAD 1983 UTM Zone 20N (EPSG:26920). Dimensions of the raster are 26870 rows x 29852 columns. The values in the raster represent the land cover classes: 1 = forb-dominated barren, 2= forb-dominated tundra, 3= snow, 4= water, 5= grass-dominated wetland, 6= moss-dominated wetland, 7= sedge-dominated wetland, 8= human infrastructure, and 9 = shadow (NoData value = 15).

The two other files in this dataset are R scripts that can be opened in RStudio software (RStudio Team). One script, named selection_classification_validation.R contains the different steps necessary to select the most relevant predictors prior to classification, to train the seven nonparametric classifiers (Artificial Neural Networks, Classification And Regression Trees, K-Nearest Neighbors, Linear Discriminant Analysis, Naive Bayes, Random Forests, and Support Vector Machines), and to evaluate the accuracy of the resulting classifications. The second script, named majority_voting.R, is the ensemble classifier with majority voting algorithm in which classified maps (rasters in geotiff format) of four of the classifiers (Random Forests, Linear Discriminant Analysis, Classification And Regression Trees, Maximum Likelihood) are combined. Note that the classified maps for the three non-parametric classifiers were created from the first script, while the classified map from the parametric classifier, Maximum Likelihood, was created using ArcGIS Pro.

Funding provided by: Natural Sciences and Engineering Research Council of Canada
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100000038
Award Number: RGPIN-2019-05292

Funding provided by: Natural Sciences and Engineering Research Council of Canada
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100000038
Award Number: RGPNS-2019-305531

Funding provided by: Canada Excellence Research Chairs, Government of Canada
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100002784
Award Number:

Funding provided by: Department of National Defence
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100018825
Award Number:

Funding provided by: Kenneth M. Molson Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100023256
Award Number:

Funding provided by: Fonds de recherche du Québec – Nature et technologies
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100003151
Award Number:

Funding provided by: Network of Centers of Excellence of Canada ArcticNet*
Crossref Funder Registry ID:
Award Number:

Funding provided by: Weston Family Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100019889
Award Number:

Funding provided by: Polar Knowledge Canada
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100012258
Award Number:

Funding provided by: BIOS2 NSERC Collaborative Research and Training Experience (CREATE) program*
Crossref Funder Registry ID:
Award Number: FONCER 509948-2018

Files

Files (26.2 kB)

Name Size Download all
md5:2852a127d5301238f8fb7e06a5b73e16
12.5 kB Download
md5:507d654fc5b5e1bd23f1e1919dcaa3de
13.7 kB Download

Additional details

Related works

Is cited by
10.1080/11956860.2021.1907974 (DOI)
10.3390/rs15123090 (DOI)
Is source of
10.5061/dryad.3bk3j9kpk (DOI)
Is supplemented by
10.7910/DVN/OHHUKH (DOI)