Published December 15, 2020 | Version 1.0
Dataset Open

GeoVectors-Europe-East-location (v0.1)

  • 1. L3S Research Center, Leibniz University Hannover, Germany
  • 2. Data Science & Intelligent Systems Group (DSIS), University of Bonn, Germany

Description

Description

The GeoVectors corpus is a comprehensive large-scale linked open corpus of OpenStreetMap (https://www.openstreetmap.org/) entity embeddings that provides latent representations of over 980 million entities. The GeoVectors capture the semantic and geographic similarities of OpenStreetMap entities and make them directly accessible to machine learning applications. The "-tags" datasets provide embeddings that capture the semantic similarities of OpenStreetMap entities. The "-location" datasets provide the geographic similarities.

Contents

This dataset was derived from an OpenStreetMap snapshot that was taken on November 10, 2020 (© OpenStreetMap contributors).

We provide the GeoVectors in region-specific subsets. This subset contains location-embeddings for the region "Europe-East" including the following countries:

  • Albania
  • Belarus
  • Bosnia-Herzegovina
  • Bulgaria
  • Croatia
  • Cyprus
  • Czech-Republic
  • Estonia
  • Finland
  • Georgia
  • Greece
  • Hungary
  • Iceland
  • Kosovo
  • Latvia
  • Lithuania
  • Macedonia
  • Moldova
  • Montenegro
  • Poland
  • Romania
  • Serbia
  • Slovakia
  • Slovenia
  • Sweden
  • Turkey
  • Ukraine

File format

The embeddings are provided in the tab-separated values (tsv) format. Each row contains the embedding of a single OpenStreetMap entity. The first column contains the OpenStreetMap type and the second column contains the OpenStreetMap id of the respective entity. The type can either be node (n), way (w), or relation (r). The remaining columns represent the dimensions of the embedding space. (See also header.tsv)

Further information:

For further information, please visit http://geovectors.l3s.uni-hannover.de

Funding:

This work was partially funded by DFG, German Research Foundation (“WorldKG", DE 2299/2-1), the Federal Ministry of Education and Research (BMBF), Germany (“Simple-ML", 01IS18054), the Federal Ministry for Economic Affairs and Energy (BMWi), Germany (“d-E-mand", 01ME19009B), and the European Commission (EU H2020, “smashHit", grant-ID 871477).

Files

Files (47.6 GB)

Name Size Download all
md5:716a7688fb70a1e102820d59d248aa46
230.8 MB Download
md5:865c97c9d1c3742e66d2cd106182cea2
2.2 GB Download
md5:cbda27afcf8fd841aad9ba292b54615f
434.2 MB Download
md5:dfe1afb4c791cb6437d33e3da346e8ed
648.8 MB Download
md5:bbb9729f3ec4b1d633382c8b2289cb6f
840.5 MB Download
md5:59518f8bed525505dbae22a09e94ae43
121.9 MB Download
md5:e1ad885b43a4981a724fbcaa12d86ef7
5.6 GB Download
md5:b4d73ed386cc9407dac0fa5c2b752729
634.7 MB Download
md5:8a331db3a4d30a767e1e317fa8989d6a
3.3 GB Download
md5:af32f58bced7fb6e0303d12522c58cf4
262.2 MB Download
md5:5c3c250f71f2316d0820fef74ef8f74a
1.1 GB Download
md5:26381b842a88278272c49de0c8c04188
298 Bytes Download
md5:cdd3332fc02af56c5cf22bfc9bafd116
1.7 GB Download
md5:2786b0e8eb5c71f740b53a159b3b19b9
242.1 MB Download
md5:f0cb13388a6d9399519290e99d7bd700
224.4 MB Download
md5:e70cba55597ea2a4008c34f663d2d253
582.2 MB Download
md5:41e6a6c1b42c5c660187b494909d5d38
1.0 GB Download
md5:8cf0b8bd980c29cfe45f28c1d9ae0359
81.1 MB Download
md5:035a353e13b959c60bd4cdcd812bab85
457.0 MB Download
md5:db518787059c7db2c93cf8ffec174634
102.6 MB Download
md5:f612d9368aa417603d96b6d68dc0f204
12.8 GB Download
md5:1f4a1aa0b9a4fea92b05e6174dc6d71d
1.4 GB Download
md5:9956cb6e48993f972abd95da6ad4709f
691.2 MB Download
md5:d2960f93daa8d29b02e2ba394b27fd37
1.8 GB Download
md5:5ec502816abf5b81265723067eed8724
1.1 GB Download
md5:5323450b1f3f75279631ed236c4db4ab
3.1 GB Download
md5:bf17532b638f9fa8abb3717995afff55
2.1 GB Download
md5:7d8a364d44fd9fbd139b661259982c9d
4.9 GB Download