There is a newer version of the record available.

Published March 14, 2023 | Version v2
Dataset Restricted

DeepCube: Post-processing dataset of social media data

  • 1. INFALIA

Description

This dataset contains the post-processing of the social media data collected for two different use cases during the first two years of the Deepcube project.

More specifically, it contains two sub-datasets, including:

  1. The UC2 dataset containing the post-processing of the Twitter data collected for the DeepCube use case (UC2) dealing with the climate induced migration in Africa. This dataset contains in total 5,695,253 social media posts collected from the Twitter platform, based on the initial version of search criteria relevant to UC2 - defined by Universitat De Valencia, focused on the regions of Ethiopia and Somalia and started from 26 June, 2021 till March, 2023.
  2. The UC5 dataset containing the post-processing of the Twitter and Instagram data collected for the DeepCube use case (UC5) related to the sustainable and environmentally-friendly tourism. This dataset contains in total 58,143 social media posts collected from the Twitter and Instagram platform (12,881 collected from Twitter and 45,262 collected from Instagram), based on the initial version of search criteria relevant to UC5- defined by MURMURATION SAS, focused on the regions of Brasil and started from 26 June, 2021 till March, 2023.

Additionally, an anottated dataset was created by Twitter historical data for UC2 the year 2010-20220.

  1. The UC2 historical anottated dataset containg the post-processing of the Twitter data collected for the DeepCube use case (UC2) dealing with the climate induced migration in Africa. This dataset contains in total 1721 annotated (412 relevant and 1309 irrelevant)  by social media posts collected from the Twitter platform , focused on the region Somalia.

INFALIA, being a spin-off of the CERTH institute (link) and a partner of a research EU project, releases this dataset containing an unlimited number of Tweet IDs for the sole purpose of enabling the validation of the research conducted within the DeepCube. Moreover, Twitter Content provided to in this dataset to third parties remains subject to the Twitter Policy, and those third parties must agree to the Twitter Terms of Service, Privacy Policy, Developer Agreement, and Developer Policy (link - https://developer.twitter.com/en/developer-terms) before receiving this download.

Notes

testtesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttesttest

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

You need to satisfy these conditions in order for this request to be accepted:

To access the dataset, please send a request to Eleni Kamateri at ekamater@infalia.com.

You are currently not logged in. Do you have an account? Log in here

Additional details

Funding

European Commission
DeepCube - EXPLAINABLE AI PIPELINES FOR BIG COPERNICUS DATA 101004188