Published April 11, 2025 | Version 5.10.0
Dataset Open

CSET scholarly literature metadata over OpenAlex works

  • 1. Center for Security and Emerging Technology

Description

This dataset contains metadata developed at the Center for Security and Emerging Technology that augments OpenAlex works, including outputs from CSET-developed classifiers. Detailed documentation is available here.

The attached zip file contains a set of JSONL files which comprise our dataset. Each row conforms to this schema, with null values omitted. This dataset is currently a work in progress and full documentation will be made available at a later date.

Research subject classifications are based on work supported in part by the Alfred P. Sloan Foundation under Grant No. G-2023-22358.

Files

cset_openalex.zip

Files (2.0 GB)

Name Size Download all
md5:e2b5a2f831e7bc8735d9154f5e348614
2.0 GB Preview Download

Additional details

Related works

Documents
10.51593/20220030 (DOI)
10.48550/arXiv.2002.07143 (DOI)
Is referenced by
Publication: 10.48550/arXiv.2403.09097 (DOI)

Software

Repository URL
https://github.com/georgetown-cset/cset_openalex
Programming language
Python
Development Status
Active