Aggregated global journal infrastructure dataset: metadata completeness, indexing coverage, and publication economics
Authors/Creators
Description
This dataset contains journal-level data integrating information from four public sources: the Crossref REST API, PubMed E-utilities API, the SCImago Journal Rank publicly available dataset (2024 release), and the Directory of Open Access Journals.
The original version (February 2026) covered 62,858 active Crossref journals and included six binary metadata completeness indicators (abstracts, licensing, resource links, references, author affiliations, and editorial update policies), PubMed indexing status, SCImago impact metrics and UNCTAD-based geographic classification, and DOAJ article processing charge data.
This updated version (April 1, 2026) expands the dataset in three ways. First, two additional metadata fields have been added: ORCID identifier deposition rates and funder information deposition rates, both sourced from the Crossref API. Second, for four metadata elements where Crossref reports continuous coverage rates rather than binary flags (abstracts, author affiliations, ORCID identifiers, and funder information), granular proportional coverage values (0–1) are now included alongside the binary indicators.
Files
01042026_journal_database.csv
Files
(21.5 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:38f2c469a134bc21f84f470d8a68637f
|
21.5 MB | Preview Download |
Additional details
Dates
- Updated
-
2026-04-01