Eurovision Song Contest Dataset

A freely-available dataset containing audio features, metadata, contest ranking and voting data of 1562 songs that have competed in the Eurovision Song Contests.

Every year, the dataset is updated with the contest’s results. This release contains the audio features, contestant metadata and voting data of 1562 entries that participated in the Eurovision Song Contest from its first occurrence in 1956 until now.

The metadata and voting data are provided by the EurovisionWorld fansite.

Star Download

Using the dataset in your research paper?

Please contact janne [dot] spijkervet [at] gmail [dot] com





How to get started

To get an initial idea of the dataset, an example Jupyter Notebook is created in the examples directory. This can be opened with jupyter notebook.

Data description

The competition ranking is provided for both finals and semi-finals. The country-to-country voting data contains 47007 voting activities, and is seperated by jury- and televoting when applicable.

contestants.csv

column description
year contest year
to_country_id (country id of contestant
to_country country name of contestant
performer artist
song title of the contestant’s song
sf_num participated in semi-final 1, 2 or 0 (from 2004-2008 there was only one semi-final
running_final order in the broadcast of the contest’s final
running_sf order in the broadcast of the contest’s semi-final
place_final place in the final
points_final points in the final
place_sf place in the semi-final
points_sf points in the semi-final
points_tele_final televoting points in the contest’s final
points_jury_final juryvoting points in the contest’s final
points_tele_sf televoting points in the contest’s semi-final
points_jury_sf juryvoting points in the contest’s semi-final
lyrics lyrics of the song
youtube_url url to video on YouTube

votes.csv

column description
year contest year
round final, semi-final
from_country_id country id of the country giving points
to_country_id country id of the country receiving points
from_country country name of the country giving points
to_country country name of the country receiving points
points number of points given

Replication

To replicate the dataset, a WebDriver for either Chrome, Firefox, or Safari is required, e.g. the WebDriver for Chrome, along with the Selenium Python package (selenium). Follow the instructions to setup the WebDriver here. The project’s dependencies can be installed using:

pip3 install -r requirements.txt

Use the following command to extact the data of all Eurovision Song Contests between 1956 and 2019:

python3 main.py --start 1956 --end 2019

This will create a contestants.csv and votes.csv file.