SROADEX: Dataset for binary recognition and semantic segmentation of road surface areas from high resolution Aerial Orthoimages Covering Approximately 8,650 km2 of the Spanish Territory Tagged with Road Information
Description
The data have been generated using scripts developed in Python using Open Source libraries (GDAL/OGR and MapScript) for rasterization of vector cartography representing the axes of the different types of roads (urban, interurban and rural). This cartography has been obtained from different Spanish official sources (National Geographic Institute and autonomic cartographic agencies) that we have revised and edited in a meticulous and systematic way to verify that the roads are represented on the cartography according to the orthoimages, available on January 1, 2021 in the download center of the National Center of Geographic Information (CNIG), on 16 rectangular areas (28,5 km * 18,5 km) of the Spanish territory (insular and peninsular).
The dataset consists of 777599 images in png format of 256x256 pixels, organized in folders for the different trainings, separating those corresponding to training, testing and validation.
The structure of the data is as follows:
1-Road-Ortho and 1-Road-Mask contain the images and ground true for training the semantic segmentation networks.
1-Road-Ortho and 2-NoRoad-Ortho contain aerial images containing or not containing vials, for the training of binary tessellation networks identifying tessellations with vials.
Moreover, in each folder the structure is the same: train, test, validation containing 90%, 5% and 5% of the total images and masks of each type.
1-Road-Ortho
|----Train
|----Test
-----Validation
1-Road-Mask
|----Train
|----Test
-----Validation
2-NoRoad-Ortho
|----Train
|----Test
-----Validation