Dataset Open Access

Data Set Knowledge Graph (DSKG)

Michael Färber; David Lamprecht

We present the Data Set Knowledge Graph (, an RDF dataset about datasets that are linked to publications (modeled in the Microsoft Academic Knowledge Graph, MAKG) that mention the datasets. The metadata of the datasets is based on datasets that are registered in OpenAIRE and Wikidata.

What exactly do we provide?

  1. Periodically updated RDF dump files of the Data Set Knowledge Graph.
  2. URI resolution of the Data Set Knowledge Graph within the Linked Open Data.
  3. A publicly accessible SPARQL endpoint containing the latest Dataset Knowledge Graph data.

How big is the Dataset Knowledge Graph?

The Dataset Knowledge Graph models, among others,

  • 2,208 datasets from all scientific disciplines
  • 813,551 links to 634,803 unique papers
  • 1,169 authors of datasets
  • 208 ORCID IDs.

Potential use cases:

  • Use the DSKG for the development of semantic search engines (e.g. use the metadata of the linked publications of the datasets for advanced search capabilities)
  • Easier data integration by using the RDF standard vocabulary DCAT and by linking resources to other data sources (e.g., combining the DSKG with other dataset collections in RDF).
  • Data analysis to measure and award the provisioning of datasets (e.g., determine the scientific influence of datasets and authors).
Files (196.6 MB)
Name Size
103.6 MB Download
93.0 MB Download
All versions This version
Views 797797
Downloads 144144
Data volume 14.0 GB14.0 GB
Unique views 713713
Unique downloads 8383


Cite as