Published March 11, 2022 | Version v1
Dataset Open

A dataset of scholarly journals in wikidata : (selected) external identifiers

  • 1. European Research Council Executive Agency

Description

 

For an updated list , see

Matching OpenAlex venues to Wikidata identifiers

Motivation : the selective/Inclusive approach in bibliometric databases 

An important difference between bibliometric databases is their “inclusion policy”

Some databases like Web Of Science and Scopus select the sources they index, while others like Dimensions and OpenAlex are more inclusive (they index for example all data from a given source such as Crossref). 

WOS

selectivity remained a hallmark of coverage because Garfield had decided early on to focus on internationally influential journals.” (...).” 

SCOPUS 

Serial content (i.e., journals, conference proceedings, and book series) submitted for possible inclusion in Scopus by editors and publishers is reviewed and selected, based on criteria of scientific quality and rigor. This selection process is carried out by an external Content Selection and Advisory Board (CSAB) of editorially independent scientists, each of which are subject matter experts in their respective fields. This ensures that only high-quality curated content is indexed in the database and affirms the trustworthiness of Scopus

 

Dimensions 

We have decided to take an “inclusive” approach to the publications we index in Dimensions. We believe that Dimensions should be a comprehensive data source, not a judgment call, and so we index as broad a swath of content as possible and have developed a number of features (e.g., the Dimensions API, journal list filters that limit search results to journals that appear in sources such as Pubmed or the 2015 Australian ERA6 journal list) that allow users to filter and select the data that is most relevant to their specific needs.



Using wikidata to enable the filtering of “ venues subsets” in OpenAlex 

 

We are interested in creating subsets of venues in OpenAlex (for example for comparative analysis with inclusive databases or other use cases). This would require matching identifiers of OpenAlex venues to other identifiers. 

Thanks to WikiCite, a project to record and link scholarly data, Wikidata has a large collection of metadata related to Scholarly journals. This repository provides a subset of the scholarly journals in Wikidata, focusing mainly on external identifiers.

The dataset will be used to explore the extent to which wikidata journal external identifiers can be used to select the content in OpenAlex. 

(see here an list of openly available lists of journals )

Dataset creation & Documentation 

  • Wikidata dump from 2022-02-21

  • Extract entities with following properties: 

    • https://www.wikidata.org/wiki/Q5633421    #  scientific journal (Q5633421)

    • https://www.wikidata.org/wiki/Q737498     #  academic journal (Q737498)

  • Extract the properties related to (selected) external identifiers 

 

Some numbers : 

Number of journals in wikidata : 113,797 ; With issn_l  95,888 , With OpenAlex_venue id :  29,150

external identifiers

https://www.wikidata.org/wiki/Property:P236  # ext_id_issn

https://www.wikidata.org/wiki/Property:P7363  # ext_id_issn_l

https://www.wikidata.org/wiki/Property:P8375 # ext_id_crossref_journal_id

https://www.wikidata.org/wiki/Property:P1055 # ext_id_nlm_unique_id

https://www.wikidata.org/wiki/Property:P1058 # ext_id_era_journal_id

https://www.wikidata.org/wiki/Property:P1250 # ext_id_danish_bif_id

https://www.wikidata.org/wiki/Property:P10283 #ext_id_openalex_id

https://www.wikidata.org/wiki/Property:P1156  # ext_id_scopus_source_id


Indexing services

https://www.wikidata.org/wiki/Property:P8875

https://www.wikidata.org/wiki/Q371467 #    Scopus

https://www.wikidata.org/wiki/Q104047209 # Science Citation Index Expanded

https://www.wikidata.org/wiki/Q22908122 #  Emerging Sources Citation Index

https://www.wikidata.org/wiki/Q1090953 #   Social Sciences Citation Index

https://www.wikidata.org/wiki/Q713927 #    Arts and Humanities Citation index

  

 

 

 

Notes

Disclaimer: The views expressed in this paper are the author's. They do not reflect the views or official positions of the ERC, its Scientific Council or the European Commission.

Files

wikidata_journal_identifiers.zip

Files (8.6 MB)

Name Size Download all
md5:4f114c6a72936795244eedc038eb6a4d
8.6 MB Preview Download