Dataset Open Access

Continental Europe Digital Terrain Model at 30 m resolution based on GEDI, ICESat-2, AW3D, GLO-30, EUDEM, MERIT DEM and background layers

Hengl, Tomislav; Leal Parente, Leandro; Krizan, Josip; Bonannella, Carmelo

Citation Style Language JSON Export

  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.4724549", 
  "language": "eng", 
  "title": "Continental Europe Digital Terrain Model at 30 m resolution based on GEDI, ICESat-2, AW3D, GLO-30, EUDEM, MERIT DEM and background layers", 
  "issued": {
    "date-parts": [
  "abstract": "<p>Digital Terrain Model for Continental Europe based on the three publicly available Digital Surface Models and predicted using an Ensemble Machine Learning (EML). EML was trainined using&nbsp;GEDI&nbsp;level 2B points (Level 2A; &quot;elev_lowestmode&quot;) and ICESat-2 (ATL08; &quot;h_te_mean&quot;): about 9 million points were overlaid vs <a href=\"\">MERITDEM</a>, <a href=\"\">AW3D30</a>, <a href=\"\">GLO-30</a>, <a href=\"\">EU DEM</a>, <a href=\"\">GLAD canopy height</a>, tree cover and surface water cover maps, then an ensemble prediction model (mlr package in R) was fitted using random forest, Cubist and GLM, and used to predict most probable terrain height (bare earth). Input layers used to train the EML include:</p>\n\n<ul>\n\t<li>&quot;lcv_bare.earth_glcf.landsat&quot;: UMD GLAD&nbsp;<a href=\"\">bare earth estimate</a> for year 2010 based on Landsat time series,</li>\n\t<li>&quot;dtm_elev.dsm_alos.aw3d&quot;: Digital Surface Model based on <a href=\"\">ALOS AW3D</a>,&nbsp;</li>\n\t<li>&quot;dtm_canopy.height_glad.umd&quot;: UMD GLAD <a href=\"\">canopy height for 2019</a> based on GEDI data,</li>\n\t<li>&quot;dtm_elev.dsm_eudem.eea&quot;: Copernicus&nbsp;<a href=\"\">EU DEM</a> based on the SRTM and ASTER DEMs,</li>\n\t<li>&quot;hyd_surface.water_jrc.gswe&quot;: <a href=\"\">JRC Global Surface Water Explorer</a>&nbsp;surface water probability based on the Landsat time-series,</li>\n\t<li>&quot;lcv_landcover.12_pflugmacher2019&quot;: land cover map of Europe at 30 based on <a href=\"\">Pflugmacher et al. (2019)</a>,</li>\n\t<li>&quot;lcv_tree.cover_umd.landsat_2000&quot;: forest tree cover for year 2000 based on the <a href=\"\">Global Forest Change data</a>,</li>\n\t<li>&quot;lcv_tree.cover_umd.landsat_2010&quot;:&nbsp;forest tree cover for year 2010 based on the <a href=\"\">Global Forest Change data</a>,</li>\n</ul>\n\n<p>Detailed processing steps can be found <a href=\"\"><strong>here</strong></a>. Read more about the processing steps <a href=\"\"><strong>here</strong></a>.</p>\n\n<p>Training data set can be obtained in the file &quot;gedi_elev.lowestmode_2019_eumap.RDS&quot;. The initial linear model fitted using the four independent Digital Surface / Digital Terrain models shows:</p>\n\n<pre><code>Residuals:\n  Min       1Q   Median       3Q      Max \n-124.627   -1.097    0.973    2.544   59.324 \n \nCoefficients:\n  Estimate Std. Error t value Pr(&gt;|t|)    \n(Intercept)         -1.6220640  0.0032415  -500.4   &lt;2e-16 ***\n  eu_dem25m_          -0.1092988  0.0005531  -197.6   &lt;2e-16 ***\n  eu_AW3Dv2012_30m_    0.0933774  0.0005957   156.7   &lt;2e-16 ***\n  eu_GLO30_30m_        0.2637153  0.0006062   435.1   &lt;2e-16 ***\n  eu_MERITv1.0.1_30m_  0.7496494  0.0005009  1496.6   &lt;2e-16 ***\n  ---\n  Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1\n\nResidual standard error: 7.059 on 9588230 degrees of freedom\n(2046196 observations deleted due to missingness)\nMultiple R-squared:  0.9996,\tAdjusted R-squared:  0.9996 \nF-statistic: 5.343e+09 on 4 and 9588230 DF,  p-value: &lt; 2.2e-16</code></pre>\n\n<p>Which show that MERIT DEM (<a href=\"\">Yamazaki et al., 2019</a>) is the most correlated DEM with GEDI and ICESat-2, most likely because it has been systematically post-processed and majority of canopy problems have been removed. Summary results of the model training (<a href=\"\">mlr::makeStackedLearner</a>) using all covariates (including canopy height, tree cover, bare earth cover) shows:</p>\n\n<pre><code>Variable: elev_lowestmode.f \nR-square: 1 \nFitted values sd: 333 \nRMSE: 6.54 \n\nEnsemble model:\nCall:\nstats::lm(formula = f, data = d)\n\nResiduals:\n     Min       1Q   Median       3Q      Max \n-118.788   -0.871    0.569    1.956  165.119 \n\nCoefficients:\n             Estimate Std. Error t value Pr(&gt;|t|)    \n(Intercept) -0.198402   0.003045  -65.15   &lt;2e-16 ***\nregr.ranger  0.452543   0.001117  405.04   &lt;2e-16 ***\nregr.cubist  0.527011   0.001516  347.61   &lt;2e-16 ***\nregr.glm     0.020033   0.001217   16.47   &lt;2e-16 ***\n---\nSignif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1\n\nResidual standard error: 6.544 on 9588231 degrees of freedom\nMultiple R-squared:  0.9996,\tAdjusted R-squared:  0.9996 \nF-statistic: 8.29e+09 on 3 and 9588231 DF,  p-value: &lt; 2.2e-16</code></pre>\n\n<p>Which indicates that the elevation errors are in average (2/3rd of pixels) between +1-2 m. The variable importance based on Random Forest package ranger shows:</p>\n\n<pre><code>Variable importance:\n               variable   importance\n4   eu_MERITv1.0.1_30m_ 430641370770\n1     eu_AW3Dv2012_30m_ 291483345389\n2         eu_GLO30_30m_ 201517488587\n3            eu_dem25m_ 132742500162\n9 eu_canopy_height_30m_   5148617173\n7             bare2010_   2087304901\n8        treecover2000_   1761597272\n6        treecover2010_    141670217</code></pre>\n\n<p>The output predicted terrain model includes the following two layers:</p>\n\n<ul>\n\t<li>&quot;dtm_elev.lowestmode_gedi.eml_mf&quot;: mean estimate&nbsp;of the terrain elevation in dm (decimeters) filtered using Gaussian filter and 2x pixel window in SAGA GIS,</li>\n\t<li>&quot;dtm_elev.lowestmode_gedi.eml_md&quot;: standard deviation of the independently fitted stacked predictors quantifying the prediction uncertainty in dm (decimeters),</li>\n</ul>\n\n<p>The predicted elevations are based on the GEDI data hence the reference water surface (WGS84 ellipsoid) is about <a href=\"\">43 m higher</a> than the sea water surface for a specific EU country. Before modeling, we have corrected the reference elevations to the Earth Gravitational Model 2008 (<strong>EGM2008</strong>) by using the <a href=\"\">5-arcdegree resolution correction surface</a> (<a href=\"\">Pavlis et al, 2012</a>).</p>\n\n<p>All GeoTIFFs were prepared using Integer format (elevations rounded to 1 m) and have been converted to <a href=\"\">Cloud Optimized GeoTIFFs using GDAL</a>.</p>\n\n<p><strong>Disclaimer</strong>: The output DTM still shows forest canopy (overestimation of the terrain elevation) and has not been hydrologically corrected for spurious sinks and similar. This data set is continuously updated. To report a bug or suggest an improvement, please visit <a href=\"\"><strong>here</strong></a>. To access DTM derivatives at 30-m, 100-m and 250-m please visit <a href=\"\"><strong>here</strong></a>. To register for updates please subscribe to: <a href=\"\"></a>.</p>", 
  "author": [
      "family": "Hengl, Tomislav"
      "family": "Leal Parente, Leandro"
      "family": "Krizan, Josip"
      "family": "Bonannella, Carmelo"
  "note": "This work has received funding from the European Union's the Innovation and Networks Executive Agency (INEA) under Grant Agreement Connecting Europe Facility (CEF) Telecom project 2018-EU-IA-0095 (", 
  "version": "v0.3", 
  "type": "dataset", 
  "id": "4724549"
All versions This version
Views 1,195428
Downloads 42173
Data volume 1.9 TB298.1 GB
Unique views 1,023362
Unique downloads 20532


Cite as