A dataset of scholarly journals in wikidata : (selected) external identifiers
Description
For an updated list , see
Matching OpenAlex venues to Wikidata identifiers
Motivation : the selective/Inclusive approach in bibliometric databases
An important difference between bibliometric databases is their “inclusion policy”.
Some databases like Web Of Science and Scopus select the sources they index, while others like Dimensions and OpenAlex are more inclusive (they index for example all data from a given source such as Crossref).
“selectivity remained a hallmark of coverage because Garfield had decided early on to focus on internationally influential journals.” (...).”
“Serial content (i.e., journals, conference proceedings, and book series) submitted for possible inclusion in Scopus by editors and publishers is reviewed and selected, based on criteria of scientific quality and rigor. This selection process is carried out by an external Content Selection and Advisory Board (CSAB) of editorially independent scientists, each of which are subject matter experts in their respective fields. This ensures that only high-quality curated content is indexed in the database and affirms the trustworthiness of Scopus”
We have decided to take an “inclusive” approach to the publications we index in Dimensions. We believe that Dimensions should be a comprehensive data source, not a judgment call, and so we index as broad a swath of content as possible and have developed a number of features (e.g., the Dimensions API, journal list filters that limit search results to journals that appear in sources such as Pubmed or the 2015 Australian ERA6 journal list) that allow users to filter and select the data that is most relevant to their specific needs.
Using wikidata to enable the filtering of “ venues subsets” in OpenAlex
We are interested in creating subsets of venues in OpenAlex (for example for comparative analysis with inclusive databases or other use cases). This would require matching identifiers of OpenAlex venues to other identifiers.
Thanks to WikiCite, a project to record and link scholarly data, Wikidata has a large collection of metadata related to Scholarly journals. This repository provides a subset of the scholarly journals in Wikidata, focusing mainly on external identifiers.
The dataset will be used to explore the extent to which wikidata journal external identifiers can be used to select the content in OpenAlex.
(see here an list of openly available lists of journals )
Dataset creation & Documentation
-
Wikidata dump from 2022-02-21
-
Extract entities with following properties:
-
https://www.wikidata.org/wiki/Q5633421 # scientific journal (Q5633421)
-
https://www.wikidata.org/wiki/Q737498 # academic journal (Q737498)
-
-
Extract the properties related to (selected) external identifiers
Some numbers :
Number of journals in wikidata : 113,797 ; With issn_l 95,888 , With OpenAlex_venue id : 29,150
external identifiers
https://www.wikidata.org/wiki/Property:P236 # ext_id_issn
https://www.wikidata.org/wiki/Property:P7363 # ext_id_issn_l
https://www.wikidata.org/wiki/Property:P8375 # ext_id_crossref_journal_id
https://www.wikidata.org/wiki/Property:P1055 # ext_id_nlm_unique_id
https://www.wikidata.org/wiki/Property:P1058 # ext_id_era_journal_id
https://www.wikidata.org/wiki/Property:P1250 # ext_id_danish_bif_id
https://www.wikidata.org/wiki/Property:P10283 #ext_id_openalex_id
https://www.wikidata.org/wiki/Property:P1156 # ext_id_scopus_source_id
Indexing services
https://www.wikidata.org/wiki/Property:P8875
https://www.wikidata.org/wiki/Q371467 # Scopus
https://www.wikidata.org/wiki/Q104047209 # Science Citation Index Expanded
https://www.wikidata.org/wiki/Q22908122 # Emerging Sources Citation Index
https://www.wikidata.org/wiki/Q1090953 # Social Sciences Citation Index
https://www.wikidata.org/wiki/Q713927 # Arts and Humanities Citation index
Notes
Files
wikidata_journal_identifiers.zip
Files
(8.6 MB)
Name | Size | Download all |
---|---|---|
md5:4f114c6a72936795244eedc038eb6a4d
|
8.6 MB | Preview Download |