Published December 14, 2020 | Version 1.0
Dataset Open

GeoVectors-Antarctica-tags (v1.0)

  • 1. L3S Research Center, Leibniz University Hannover, Germany
  • 2. Data Science & Intelligent Systems Group (DSIS), University of Bonn, Germany

Description

Description

The GeoVectors corpus is a comprehensive large-scale linked open corpus of OpenStreetMap (https://www.openstreetmap.org/) entity embeddings that provides latent representations of over 980 million entities. The GeoVectors capture the semantic and geographic dimensions of OpenStreetMap entities and make them directly accessible to machine learning applications. The "-tags" datasets provide embeddings that capture the semantic dimension of OpenStreetMap entities. The "-location" datasets provide the geographic dimension.

Contents

This dataset was derived from an OpenStreetMap snapshot that was taken on November 10, 2020 (© OpenStreetMap contributors).

We provide the GeoVectors in region-specific subsets. This subset contains tag-embeddings for the region "Antarctica" including the following countries:

  • Antarctica

File format

The embeddings are provided in the tab-separated values (tsv) format. Each row contains the embedding of a single OpenStreetMap entity. The first column contains the OpenStreetMap type and the second column contains the OpenStreetMap ID of the respective entity. The type can either be node (n), way (w), or relation (r). The remaining columns represent the dimensions of the embedding space. (See also header.tsv)

Further information:

For further information, please visit http://geovectors.l3s.uni-hannover.de

Funding:

This work was partially funded by DFG, German Research Foundation (“WorldKG", DE 2299/2-1), the Federal Ministry of Education and Research (BMBF), Germany (“Simple-ML", 01IS18054), the Federal Ministry for Economic Affairs and Energy (BMWi), Germany (“d-E-mand", 01ME19009B), and the European Commission (EU H2020, “smashHit", grant-ID 871477).

Files

Files (28.0 MB)

Name Size Download all
md5:517f5f8d7fae2c39956fa9266bde93e1
28.0 MB Download
md5:aeac30cf1c385adb69ab0e962ebb3efd
1.1 kB Download