Published March 8, 2022 | Version v1
Dataset Open

Long-term series of daily snow depth dataset over the Northern Hemisphere based on machine learning (1980-2019)

  • 1. Northwest Institute of Eco-Environment and Resources
  • 2. College of Forestry, Sichuan Agricultural University

Description

Long-term series of daily snow depth dataset over the Northern Hemisphere based on machine learning (1980-2019)

 

Supported by the Strategic Priority Research Program of the Chinese Academy of Science (XDA19070100). Tao Che, the director of this program, who comes from Key Laboratory of Remote Sensing of Gansu Province, Northwest Institute of Eco-Environment and Resources, CAS. They used machine learning methods combined with multi-source gridded snow depth product data to derive a long-time series over the Northern Hemisphere.

Firstly, the applicability of artificial neural network (ANN), support vector machine (SVM) and random forest (RF) method in snow depth fusion are compared. It is found that random forest method shows strong advantages in snow depth data fusion. Secondly, using the random forest method, combined with remote sensing snow depth products such as AMSR-E, AMSR-2, NHSD and GlobSnow and reanalysis data such as ERA-Interim and MERRA-2. These gridded snow depth products and environmental factor variables are used as the input independent variables of the model. In situ observations of China Meteorological Station (945), Russia Meteorological Station (620), Russian snow survey data (514), and global historical meteorological network (41261) are used as reference truth to train and verify the model. The daily gridded snow depth dataset of the snow hydrological year from 1980 to 2019 (September 1 of the previous year to May 31 of the current year) is prepared on the cloud platform provided by the CASEarth. Since the passive microwave brightness temperature data from 1980 to 1987 is the data of every other day, there will be a small number of missing trips in the data during this period. Using the ESM-SnowMIP and independent ground observation data for verification, the quality of the fusion data set has been improved. According to the comparison between the ground observation data and the snow depth products before fusion, the determination coefficient (R2) of the fusion data is increased from 0.23 (GlobSnow snow depth product) to 0.81, and the corresponding root mean square error (RMSE) and mean absolute error (MAE) are also reduced to 7.7 cm and 2.7 cm.

 

File naming and required software

 

The daily snow depth data is a matrix with 1440 columns and 360 rows, and the spatial resolution of the data is 0.25 °, NoData_Value is represented by - 9999. The snow depth datasets are stored according to the natural year folder by folder, the folder of each year contains the daily fused gridded snow depth dataset of that year. The data format of fused snow depth dataset is stored in GeoTiff format, the unit of snow depth data is centimeter, and the projection is WGS84. Among these years, the daily fused snow depth dataset from September 1, 1980 to December 31 of that year were contained in folder of 1980, and the folder of 2019 contains the daily fused snow depth dataset from 1 January 1, 2019 to May 31 of that year. Except for these two years, al folders contain the fused snow depth dataset from January 1 to May 31 and September 1 to December 31. Because of the quality of the original gridded snow depth datasets, daily snow depth datasets from June 1 to August 31 were not considered in our study. The file name is "ML_NHSD_YYYY-MM-DD.tif", where ML stands for machine learning, NHSD stands for Northern Hemisphere Snow Depth, YYYY stands for the natural years from 1980 to 2019, MM stands for January to December (except June to August), and DD stands for the date. The file can be viewed with software such as ArcMAP or QGIS and processed accordingly.

The original gridded snow depth datasets of NHSD and GlobSnow were produced with the passive microwave brightness temperature from SMMR sensor in 1980 to 1987. The SMMR dataset was the other day dataset and with large missing bands. Snow depth dataset of NHSD and GlobSnow were also inevitably have a large number of missing data, and the fused snow depth dataset have a similar situation. The fused snow depth is as complete as possible for the land area of the Northern Hemisphere, there are minor missing data on some days.   

Files

ML_NHSD_19802019_Geotiff.zip

Files (3.0 GB)

Name Size Download all
md5:10a4a044b6e462dc5c8a8541126b4bed
3.0 GB Preview Download
md5:8e5fbbd06b0ab3140259b1a28b921c38
199.7 kB Download
md5:36707fa4d0504482dd503a14d32335b0
17.6 kB Download