Published December 15, 2021 | Version 1.0.4
Dataset Open

VC-SLAM Versatile Corpus for Semantic Labeling And Modeling

  • 1. University of Wuppertal

Description

Benchmark Corpus for semantic labeling and modeling.

This corpus contains 101 data sets from different open data portals.
Each data set consists of the following data:

  • Raw csv data [rawdata_csv]
  • Large json data sample [json_sample_large]
  • Small json data sample [json_sample_small]
  • Raw data in csv format [rawdata_csv]
  • Raw data samples in csv format [rawdata_csv_samples]
  • Mappings to translate between csv and json files [csv_json_mappings]
  • Textual description / Metadata [descriptions]
  • Semantic model as rdf/ttl [semantic_models]
  • Mappings describing mapping between raw data attributes and concepts from the ontology [mappings]
  • List of attributes that have been ignored during modeling [ignored_attributes]

Additionally the corpus contains a target ontology as rdf/ttl [ontology].

The individual data sets are licensed by the licenses specified in the attached Excel sheet (DataSetOverview.xlsx)

 

These data are provided "as is", without any warranties of any kind. The data are provided under the Creative Commons Attribution 4.0 International license.

Files

vc-slam.zip

Files (1.6 GB)

Name Size Download all
md5:9aaeb01108bf04edad228d3b5093795d
35.5 kB Download
md5:b0ca1cb8c162d10a112361cdd58a68a6
1.6 GB Preview Download