Published January 19, 2021 | Version v1
Dataset Open

Codes and Datasets for: 'The Largest Academic Publishers of Scholarly Journals: A Webscraping Approach'

Authors/Creators

  • 1. TU Wien

Description

This set comprises:

  • "count_publishers*": four R codes ("count_publishers*") that draw from DOAJ, Publons, Scopus and SherpaRomeo to extract scholarly publishers and the journal counts assigned to each publisher;
  • "data*": two underlying data samples (from DOAJ and Scopus) - the two other samples are accessed via webscraping;
  • "harmonize*": one text-file and one R-code for harmonizing publisher names;
  • "alljournals.xlsx": the resulting list of scholarly publishers ordered by the highest number of journal counts assigned to them.

Files

data--doaj.json

Files (59.1 MB)

Name Size Download all
md5:0d1a3040a1e80e5e1af291a219e423d6
30.8 kB Download
md5:fecf78e4aece1d8a05da3494567a3c4d
535 Bytes Download
md5:ea6ac81101d530eb5349af4b2e28ade1
1.2 kB Download
md5:9a24f4dddea8d07e18985a1ca00df8eb
391 Bytes Download
md5:1e5667d51b0ab8869a1854a6644094cc
761 Bytes Download
md5:aba889ee35659aa67fcc807526882cef
33.3 MB Preview Download
md5:52be1c499b002500e825b8d57d99d320
25.7 MB Download
md5:b93289d05d6c04072df255bb8d896ff5
2.7 kB Preview Download
md5:9bee454812f0bf77ff44e3821d452acf
3.1 kB Download