There is a newer version of the record available.

Published September 30, 2022 | Version 1.0
Dataset Open

Horizon projects network

  • 1. Area Science Park

Description

Horizon 2020 projects are a major initiative to foster collaborative research and innovation at European level. Results and metadata are available as open data at https://data.europa.eu/data/datasets/cordish2020projects and can be explored in https://cordis.europa.eu/datalab/.

The aim of this project is to create a dataset that can be used to analyse collaborations among European organisations (companies, academia, research centres...) and hierarchical aggregation (NUTS 1,2 and 3 level) as well as by topic (EuroSciVoc) and Horizon 2020 pillar as package is to analyse. 

The main issues taken into account are enhanced interoperability of geo-location data, a tabular form of first level EuroSciVoc codes and encoding of LegalBasis as Horizon 2020 pillars. 

Data is provided as a set of interoperable .CSV files and as a network encoding the whole dataset (that can be analysed with Pyrton or R i-graph package). Currently geographical encoding at NUTS3 level is carried out for Italy and is available at NUTS1 level for all countries. Specifically, the i-graph format is suitable for further analysis with centrality measures and community detection.

List of CSV files
filename content

 orgs.csv 

List of organisations (network nodes) including geolocation encoded as NUTS1-2-3

projects.csv 

List of projects (network edges) including ids of organisations, calls (pillars) and topics (EuroSciVoc)

project_call.csv

List of projects associated to calls (a project can be associated to one or more calls)

project_topic_esv.csv 

List of projects associated to topics encoded as first level of EuroSciVoc taxonomy (a project can be associated to one or more topics)

topic_esv.csv  

List of topics encoded as first level of EuroSciVoc taxonomy, including an extended description and full code

project_topic_esv_table.csv

Table of topics encoded as first level of EuroSciVoc taxonomy. topics are attributed 
participation.csv List of organisations and projects, highlighting ecContribution, totalCost, and role of coordinator.
project_codes_call_2.csv extended descriptions of calls
network.csv network (nodes = organisations, edges = projects) in "long" dataframe format, suitable for analysis using i-graph package in R and Python.

 

Notes

The full scripts and development versions of this project is available online at https://gitlab.com/fabio-morea-areasciencepark/horizon-projects-network

Files

network.csv

Files (19.0 MB)

Name Size Download all
md5:091c014790b222f91f1f4aa743adc978
1.9 MB Preview Download
md5:5578a180a9166760aae0ee77b828457e
959.5 kB Preview Download
md5:2dcca6ddc1c8f873f8433cf203e47fb8
3.0 MB Preview Download
md5:b7118c099c7cbd45421681fe612c350c
1.4 MB Preview Download
md5:826dc5ec7bb219b60a2af8a878c2a120
1.3 MB Preview Download
md5:4f18e9b80f0014568954ff79ba3c7249
3.8 kB Preview Download
md5:fdfd258bb4b74c8a55333934c3c883e8
9.1 MB Preview Download
md5:18619f26bbc99d8a6df422e7710854d4
1.2 MB Preview Download
md5:353e41d00a3cb90baba25dac843f50d5
93.0 kB Preview Download

Additional details