Learning Aerial Image Segmentation From Online Maps

doi:10.1109/TGRS.2017.2719738

Published July 21, 2017 | Version v1

Dataset Open

Learning Aerial Image Segmentation From Online Maps

1. ETH Zurich

This is the CITY-OSM dataset used in the journal publication "Learning Aerial Image Segmentation From Online Maps".

Paper abstract:

This paper deals with semantic segmentation of high-resolution (aerial) images where a semantic class label is assigned to each pixel via supervised classification as a basis for automatic map generation. Recently, deep convolutional neural networks (CNNs) have shown impressive performance and have quickly become the de-facto standard for semantic segmentation, with the added benefit that task-specific feature design is no longer necessary. However, a major downside of deep learning methods is that they are extremely data hungry, thus aggravating the perennial bottleneck of supervised classification, to obtain enough annotated training data. On the other hand, it has been observed that they are rather robust against noise in the training labels. This opens up the intriguing possibility to avoid annotating huge amounts of training data, and instead train the classifier from existing legacy data or crowd-sourced maps that can exhibit high levels of noise. The question addressed in this paper is: can training with large-scale publicly available labels replace a substantial part of the manual labeling effort and still achieve sufficient performance? Such data will inevitably contain a significant portion of errors, but in return virtually unlimited quantities of it are available in larger parts of the world. We adapt a state-of-the-art CNN architecture for semantic segmentation of buildings and roads in aerial images, and compare its performance when using different training data sets, ranging from manually labeled pixel-accurate ground truth of the same city to automatic training data derived from OpenStreetMap data from distant locations. We report our results that indicate that satisfying performance can be obtained with significantly less manual annotation effort, by exploiting noisy large-scale training data.

Files

berlin.zip

Files (23.8 GB)

Name	Size	Download all
berlin.zip md5:cc721863b953244ce171553a79900542	2.0 GB	Preview Download
chicago.zip md5:81444e9395fbc0325a419062f5f65224	5.4 GB	Preview Download
paris.zip md5:8d9816e5151fa93b0717d13d5296e944	11.0 GB	Preview Download
potsdam.zip md5:6395903b542bc2405902bf9c60cda6f2	472.1 MB	Preview Download
README.md md5:5cb0397a0bf65502532aeaa7d7221954	1.7 kB	Preview Download
tokyo.zip md5:04c44ba791c8f92f143605d668234476	9.6 MB	Preview Download
zurich.zip md5:a86123f6e1cfdeee818d8962af6dab9f	4.9 GB	Preview Download

	All versions	This version
Views	19,712	19,641
Downloads	15,622	15,577
Data volume	237.1 TB	236.5 TB

Learning Aerial Image Segmentation From Online Maps

Creators

Description

Files

berlin.zip

Files (23.8 GB)