Published August 18, 2022 | Version 1
Dataset Open

Experimental AI corpus from OpenAlex

  • 1. Nesta

Description

A corpus of AI research from OpenAlex. Includes:

  • A works table with metadata about AI papers
  • An authors table with information about the authors
  • An institutions table with information about institutions
  • A concepts table with information about concepts in works
  • A MeSH table with information about MeSH terms in works
  • A concepts json with the OpenAlex concept taxonomy
  • An abstracts json with deinverted abstracts
  • A citations json with citations from papers

See `ai_openalex_description.md` for data dictionaries.

See `ai_openalex_methodology.md` for a description of the method used to create the dataset.

See here for additional information: https://github.com/nestauk/ai_genomics

Files

ai_abstracts.json

Files (4.2 GB)

Name Size Download all
md5:087f2188920bed123e8052d19622e855
1.2 GB Preview Download
md5:f0879b9d16d4b5f6679f3316f021cb33
621.3 MB Preview Download
md5:bb1261baa1dfd7ffc9f27b78b95955b5
739.1 MB Preview Download
md5:4f41861245e897561ccf3ff4f8aa98a4
936.3 MB Preview Download
md5:d1d1d6c7561a4756bcccc604b9c718f7
2.2 MB Preview Download
md5:6c173113371b66e1f0c58bd16ad00b45
35.8 MB Preview Download
md5:96bcd037db88afb977551c60b9f52d1e
3.5 kB Preview Download
md5:7fb4aacd57c39b20dec2a9ca200facb4
8.7 kB Preview Download
md5:078b68d8f16a138ddf1a3352bbae298d
314.5 MB Preview Download
md5:b9d1ff921057a4976e3db585b4f175ac
417.6 MB Preview Download