Published June 8, 2026 | Version 11.1.1
Dataset Open

OpenAIRE Graph Dataset

Description

The OpenAIRE Graph is exported as several files, so you can download the parts you are interested into.

publication_[part].tar: metadata records about research literature (includes types of publications listed here)
dataset_[part].tar: metadata records about research data (includes the subtypes listed here
software.tar: metadata records about research software (includes the subtypes listed here)
otherresearchproduct_[part].tar: metadata records about research products that cannot be classified as research literature, data or software (includes types of products listed here)
organization.tar: metadata records about organizations involved in the research life-cycle, such as universities, research organizations, funders.
datasource.tar: metadata records about data sources whose content is available in the OpenAIRE Graph. They include institutional and thematic repositories, journals, aggregators, funders' databases.
project.tar: metadata records about project grants.
person.tar: metadata records about persons (researchers and other research contributors) aggregated from ORCID
communities_infrastructures.tar: metadata records about research communities and research infrastructures
[source_type]_[semantics]_[part].tar: metadata relations split by source entity type and semantics of the relation. For example product_Cites_1.tar refers to citation relationships

Each file is a tar archive containing gz files, each with one json per line. Each json is compliant to the schema available at http://doi.org/10.5281/zenodo.20559578. The documentation for the model is available at https://graph.openaire.eu/docs/data-model/

Learn more about the OpenAIRE Graph at https://graph.openaire.eu.

Discover the graph's content on OpenAIRE EXPLORE and our API for developers.

This deposition contains:

  • number of publications 218 421 450
  • number of datasets 101 778 366
  • number of software 730 346
  • number of other research products 37 832 720
  • number of datasources 163 288
  • number of projects 3 909 902
  • number of organizations 494 099
  • number of communities 37
  • number of person 14 803 875
  • number for relations:
    • product_hasAuthorInstitution relations 371 332 267 
    • product_Cites 2 348 459 637
    • product_IsSupplementTo 2 908 261
    • product_HasPart 6 431 551
    • product_Compiles 35 342
    • product_Continues 377 264
    • product_Describes 6 700
    • product_Documents 372 295
    • product_IsIdenticalTo 108 874
    • product_IsMetadataOf 854
    • product_IsRelatedTo 535 491 925
    • product_IsSourceOf 869 811 568
    • product_References 5 598 520
    • product_Requires 1 869
    • product_Reviews 47 027
    • product_HasVersion 630 645
    • product_IsNewVersionOf 787 815
    • product_IsOriginalFormOf 84 882
    • product_Obsoletes 2 814
    • project_produces 10 038 714
    • project_hasParticipant 5 550 746
    • datasource_hosts 981 889 144
    • datasource_provides 588 227 647
    • organization_provides 58 900
    • organization_IsChildOf 6 280
    • person_authorship 234 256 888
    • person_coAuthorship 372 872 930
    • person_authorAffiliation 7 435 371
    • person_projectParticipation 831 589

Notes

A new version of this dataset is published every 6 months. The content available on the OpenAIRE EXPLORE and CONNECT portals might be more up-to-date with respect to the data you find here.

Notes

This version is provided with some novelties: 

  • relatiosnhips are split by semantics of the relation with respect to the result type of the source node
  • relationships are provided only in one direction that is the one explicited by the semantcs in the name of the upload
  • NEW Person entities
  • NEW relationships insisting on the person entity:
    • authorAffiliation: employment collected from ORCID
    • projectPartecipation: link from the person and the project. These links can be obtained directly from the funder database of via propagation process (linke to the documentation)
    • authorship: relations of authorship with some attributes:
      • declaredAffiliation when a matching organization is found
      • role the contribution of the author
      • corresponding 
    • coAuthorship: co-authorship relations with the number of coAuthored products

Files

Files (378.4 GB)

Name Size Download all
md5:7b1209615a875f352fc8476279070826
37.2 kB Download
md5:6eaa2b5c2af34c47ff30e84616b40c6a
10.7 GB Download
md5:5eb853b768c9ff064fc1c5410988d855
10.7 GB Download
md5:361cd8f61f861fbc63d8449fed75184a
3.8 GB Download
md5:8e2e1a21f2fbacd5da8045bcad7ae673
15.3 MB Download
md5:0b29f1029e96ae2762e1cf3768a5b211
10.8 GB Download
md5:07c200d3ce67c5d212d7c9ba31f1a640
10.8 GB Download
md5:0db6fc992dc81d9981305c34766b937d
3.0 GB Download
md5:83e655027b173e573ad4ddbbc3ce0ac9
10.8 GB Download
md5:62f2fbd2460933d9f1efeee7eda59055
8.9 GB Download
md5:e2543fff41bb438c1150a634f3ec7462
41.8 MB Download
md5:f8460a362f08cf5ad44ce0b78db61c81
703.0 kB Download
md5:65f30ed7e9768b0e6bd0ab18520e0f91
2.9 MB Download
md5:923976b23b917bcca2d54aa0f2082dcf
10.7 GB Download
md5:a5602598cf79693ad62ee600e5210d98
2.0 GB Download
md5:4323249b38e97b0f81564fc7ad5931df
1.1 GB Download
md5:f6bb6de2e13dc0fc8f0a25c1d7f7f916
185.3 MB Download
md5:4fdf9ffc3efd8605f6e1cedbf20ac20c
10.7 GB Download
md5:bb573ecfe23abc30c900563aebb69d57
1.5 GB Download
md5:a74836356fe84bcaf990463c1a0f9c0b
8.6 GB Download
md5:4cdbff4add24cc61161e0bd43edb6c66
22.5 MB Download
md5:f0e566965351e3ea7ad3def604e5b771
10.8 GB Download
md5:cb169b6fb3c04354d7053ec454f6e80f
10.8 GB Download
md5:596e490f91371c972cc496e0202f9418
10.8 GB Download
md5:6b2726932b0ffaf494cb9627bc2b899c
10.8 GB Download
md5:3ffad2d9f0b9c4a30ca5588ba4278009
10.8 GB Download
md5:17bad84892e84ab353fa553f1d09f97d
10.8 GB Download
md5:1bc6f5a3b805eba5ba4b24720302f397
10.8 GB Download
md5:00d3785c08bdee6d4ae0b7c30e53806a
76.8 MB Download
md5:75ad2f294ec176ef71ff052ef8f28a07
1.6 MB Download
md5:06bc77f8eaf91efe6afc2acd0205c439
16.9 MB Download
md5:040cc34c5ae693c8381d172d56fd3498
871.9 kB Download
md5:1f4222fdd37f8c2ccab2ca77007b7199
12.7 MB Download
md5:073980928ee0b47e49626e23957e6192
8.8 GB Download
md5:1dbe09344050ccab747aac4cbf8a59fa
272.4 MB Download
md5:cf82d33b7693c73810688df505de6384
28.5 MB Download
md5:dbb0596d65f7d61fddfda76c82742d68
5.3 MB Download
md5:8772f08468a4e072921ffac50d739533
408.6 kB Download
md5:e7c327381439736f2cebcb6afec5eb45
34.2 MB Download
md5:10661ed133325e2fabeea399ab9486ec
3.8 MB Download
md5:ac8d74b7fc8c08b008d7ea0a2248d7b9
9.7 GB Download
md5:ef5423a20a1d90177e6e504287459b19
10.7 GB Download
md5:bf175465a282a73dd2f297d3cf759af2
10.5 GB Download
md5:befbb4e1ab6f52fbe0cbeb327b26acaf
122.3 MB Download
md5:f28532f41e507c15064b6a825151ad6e
623.6 kB Download
md5:2846dca517db8cb385958f6afa39e95e
229.3 MB Download
md5:20b913ea925e2ae8ec3bffd9868344bd
560.1 kB Download
md5:4fdccb88361588c94de7865c6065ad10
2.6 MB Download
md5:0c278277d6992030f43e9af1fdf47c8e
722.6 MB Download
md5:478ec8660b36e97b6f3087ba3009b4a0
158.7 MB Download
md5:f07d4d62eb8ba364d9ed79cbb483b2fa
432.5 MB Download
md5:42f66ef3bf486478e92e0695bc383367
10.7 GB Download
md5:c5bff4785c156cfc4bc92e7fbd987040
10.7 GB Download
md5:87436ae475381c28595edc4f6e106125
10.7 GB Download
md5:9e1c45e831061ab448b185b7d9218ee0
10.7 GB Download
md5:902450ac23de07095eb413a4fc130fcd
10.7 GB Download
md5:b107c154cdeffa4db3ac978a8a1af82c
10.7 GB Download
md5:3d0b621fec7f6685572d8e4298fca4f4
5.9 GB Download
md5:2e291e60b8f6f8fdfe84f97e2fd76e1f
10.7 GB Download
md5:0665e8232fb75a392c7b983c697280bf
10.7 GB Download
md5:a37ef3bdd910e0b3ddc5883c87e16de1
10.7 GB Download
md5:03b52586ab37373aaffcd4f8bd566781
10.7 GB Download
md5:6074c535bca7f0f159aebf311a1659d7
10.7 GB Download
md5:a7a16b3e7772031e451e34eaf4b87570
10.7 GB Download
md5:0bbd88a440e098a1f6e7923dec66877f
10.7 GB Download
md5:eb261b2990365cd441689257f15482cb
10.7 GB Download
md5:160b7de108c6b3433dd86e3aa7bc5c2a
258.2 MB Download

Additional details

Related works

Has metadata
Other: 10.5281/zenodo.20559578 (DOI)

Funding

European Commission
SciLake - Democratising and making sense out of heterogeneous scholarly content 101058573
European Commission
GraspOS - GraspOS: next Generation Research Assessment to Promote Open Science 101095129
European Commission
FAIRCORE4EOSC - Core Components Supporting a FAIR EOSC 101057264
European Commission
OSTrails - Open Science Plan-Track-Assess Pathways 101130187
European Commission
EOSC Beyond - EOSC Beyond: advancing innovation and collaboration for research 101131875