Published April 8, 2026 | Version v1
Dataset Open

Spatial dataset from the work "Bridging soil mapping units and digital soil mapping through probability-driven machine learning: an uncertainty-aware framework for soil class prediction in Antarctica"

  • 1. ROR icon Universidade Federal de Viçosa
  • 2. ROR icon Universidade de São Paulo
  • 3. ROR icon Universidade Federal do Rio de Janeiro

Description

 

This spatial dataset supports the results presented in the scientific article Bridging soil mapping units and digital soil mapping through probability-driven machine learning: an uncertainty-aware framework for soil class prediction in Antarctica”, currently under review in Catena. It includes spatial predictions of soil classes, final soil mapping unit–structured soil maps , and intermediate outputs generated within the modeling framework

The files are organized as follows:

  • probability_maps_per_soil_class.zip: The first set of spatial data generated consists of 19 raster files (.TIF), representing continuous probability maps for each soil class across the study area. These maps correspond to median probabilities derived from 100 model runs, with values ranging from 0 to 1.
  • probable_soil_classes.zip: This archive contains five raster files (.TIF) representing the ranked soil classes at each pixel, ranging from the most probable to the fifth most probable soil class. Each raster contains discrete values from 1 to 19, where these numbers represent different soil classes, as listed below:
  1. Anthorthents / Anhytrubels
  2. Aquiturbels
  3. Aquorthels
  4. Cryopsamments
  5. Dystrogelepts / Humigelepts
  6. Fibrists / Saprists
  7. Gelaquents / Psammaquents
  8. Gelaquepts / Petraquepts
  9. Gelifluvents
  10. Gelorthents
  11. Haplogelepts
  12. Haplohemists / Cryofibrists
  13. Haplorthels
  14. Haploturbels
  15. Psammorthels
  16. Psammoturbels
  17. Rocky Outcrops
  18. Umbrothels / Umbriturbels
  19. Vitrigelands
  • probability_probable_soil_classes.zip: Five raster files (.TIF) representing the probability values associated with the ranked soil classes from the most probable to the fifth most probable soil maps.
  • final_maps.zip: Vector files in SHP format (polygons) containing the final soil maps structured by mapping units, generated through the integration of the intermediate maps described above (see methodology in the forthcoming article). The files include a complete attribute table with soil class names, corresponding codes (1–19), and polygon areas. Mapping units may contain from one up to five soil classes (components).
  • The file soil_mapping_raw_with_probs also includes the mean probabilities and standard deviation for each component. The file soil_mapping_units_aggregated represents the dissolved mapping units, while files ending in _1000, _2500, and _5000 correspond to generalized mapping units for these scales, based on the USDA minimum mappable area concept.
  • number_classes_per_mapping_unit.zip: At last, this archive contains a single raster file (.TIF) with discrete numerical values ranging from 1 to 5, representing the number of components (soil classes) within each mapping unit.

Files

final_maps.zip

Files (1.7 GB)

Name Size Download all
md5:a333890c88f54f56ba4dcb1f40e850d2
314.7 MB Preview Download
md5:dff49d60887bd6f924d35c207d4de19d
3.6 MB Preview Download
md5:5359d4ed0d81b410536560e3abd8556d
879.8 MB Preview Download
md5:71856627caf584d8df4d504dbcc6f352
316.3 MB Preview Download
md5:e2d1a4fe96e65982bdcda93165799f73
151.7 MB Preview Download