Published December 1, 2023 | Version v1
Dataset Restricted

Binary Field of Research Classification Data

Description

The zip file includes the following data:

  1. Class category list (for categorical baseline),
  2. Document text: containing the title + abstract of each dataset instance.
  3. Class ORKG texts only: containing the dataset's ORKG label for each instance.
  4. Class DBpedia texts only: containing texts extracted from the linked DBpedia entity for each instance (the text is derived from rdfs:label + rdfs:comment)
  5. Class KGEs: containing DBpedia entity embeddings for each instance. The embeddings are pre-trained and extracted from https://zenodo.org/records/6384728.
  6. Author embedding per instance.
  7. Publisher embedding per instance. 
  8. Labels: containing binary labels indicating whether an instance is a positive or negative sample. 

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.