Published September 4, 2023 | Version 1
Dataset Open

Uncovering the Citation Landscape: Exploring OpenCitations COCI, OpenCitations Meta, and ERIH-PLUS in Social Sciences and Humanities Journals - DATA PREPROCESSED

  • 1. University of Bologna

Description

This zipped folders contain all the data preprocessed for the research "Uncovering the Citation Landscape: Exploring OpenCitations COCI, OpenCitations Meta, and ERIH-PLUS in Social Sciences and Humanities Journals": the cleaned datasets (coci_preprocessed, meta_preprocessed, erih_preprocessed and erih_meta).

  • coci_preprocessed.zip: this archive contains CSVs with two columns “citing” and “cited”, giving information about publications involved in citations according to the OpenCitations COCI dataset (version 19 released on January 2023), and that are entirely contained in OpenCitations META (version 3 released on February 2023). This means that the citations which have either the citing or the cited entity (or both) not contained in META are excluded from coci_preprocessed dataset.

  • meta_preprocessed.zip: all the original columns of OpenCitations META are maintained in this dataset, so the CSVs have the columns: “id”, “title”, “author”, “issue”, “volume”, “venue”, “page”, “pub_date”, “type”, “publisher” and “editor”. The only difference with the original dataset is that meta_preprocessed in the columns “id” and “venue” has respectively just the DOIs and the ISSNs, without all the other identifiers specified for each entity in META.

  • erih_preprocessed.zip: it contains a  CSV file with two columns "venue_id" and "ERIH_disciplines". "venue_id" is the union of the original columns "Online ISSN" and "Print ISSN" of ERIH_PLUS (version downloaded on 2023-04-27).

  • erih_meta.zip: it contains CSV files obtained from the union of meta_preprocessed and erih_preprocessed, they have all the columns of meta_preprocessed plus a new column “erih_disciplines” containing all the disciplines linked to a venue (identified by an ISSN).

 

Software: https://doi.org/10.5281/zenodo.8326023

Data produced: https://doi.org/10.5281/zenodo.7974816

Article: https://zenodo.org/record/8326044

DMP: https://zenodo.org/record/8324973

Protocol: https://doi.org/10.17504/protocols.io.n92ldpeenl5b/v5

Files

1coci_preprocessed.zip

Files (32.3 GB)

Name Size Download all
md5:08e8bc63be0910a416478cc8708467d6
15.4 GB Preview Download
md5:5fc3fa5d64f1c9f5d0fa398d735b6fc9
8.6 GB Preview Download
md5:18a19a107e262dcb2e55de062d73f3b7
158.8 kB Preview Download
md5:6fa8645cad553ce1ffa64d3d3ff94dc2
8.3 GB Preview Download