Published April 27, 2023 | Version v1
Dataset Open

Three datasets of global monthly gross primary productivity (GPP) during 2003-2018 derived from SIF, NIRv and LAI and their best-matching environmental factors

  • 1. Peking University

Description

As the largest source of uncertainty in carbon cycle studies, accurate quantification of gross primary productivity (GPP) is critical for the global carbon budget in the context of global climate change. Numerous remote sensing vegetation indices (VIs) have participated in the estimation of global GPP. However, the relative performance of various VIs in estimating GPP and what additional factors should be combined with them to reveal the photosynthetic capacity of vegetation mechanistically better are still poorly understood.

We used the Random Forest (RF) algorithm to identify the factors with the most powerful explanation of GPP and to explore the importance of these predictors. We trained six RF models to select features, i.e., two types of models (Plant Functional Type [PFT]-specific and universal) for each vegetation index (SIF, NIRv, and LAI). Each model comprised 100 decision trees, was sampled without replacement, and was trained using 70% of the data. Model performance was evaluated using out-of-bag (OOB) R-squared (R2) and root mean square error (RMSE) values. The predictor with the lowest importance score in the iteration was removed and the whole procedure was then repeated until only the vegetation index, CO2, and PFTs were left. The predictors used to estimate GPP were identified based on the performance curve of OOB R2 and RMSE. The determination of the model is based on the principle that further reductions in the number of predictors would considerably reduce model performance, while increasing the number of predictors would not significantly improve model performance.

Here we provide a set of high-spatial resolution (1/12°) global gridded products of monthly GPP for 2003-2018 generated for each vegetation index based on a generic model with an optimal configuration, i.e., an optimal combination of VI and other relevant variables using the RF algorithm. R2 of three optimal VI-based GPP estimation models ranges from 0.84 to 0.85, and RMSE ranges from 1.51g C·m−2·d−1 to 1.54g C·m−2·d−1. More information about the datasets can be found in Zhao and Zhu (2022) Remote Sensing.

Zhao W, Zhu Z. Exploring the Best-Matching Plant Traits and Environmental Factors for Vegetation Indices in Estimates of Global Gross Primary Productivity[J]. Remote Sensing, 2022, 14(24): 6316.

Files

GOSIF_RF_GPP.zip

Files (6.7 GB)

Name Size Download all
md5:88e12c2a77da443abf8bd2f30c3697e7
2.3 GB Preview Download
md5:7c8d1da6ed5b99564c20520f412bd2a5
2.3 GB Preview Download
md5:fc59ecf199ad716ac04506045466f033
2.2 GB Preview Download

Additional details

References

  • Zhao W, Zhu Z. Exploring the Best-Matching Plant Traits and Environmental Factors for Vegetation Indices in Estimates of Global Gross Primary Productivity[J]. Remote Sensing, 2022, 14(24): 6316.