Published November 28, 2022 | Version v1
Dataset Open

Datasets containing the results from the analysis on SDGS and eHealth inside the Citizen Science Community on Twitter

  • 1. URJC

Description

This datasets contain the results from our analyses of the Citizen Science Community on Twitter. These analyses have been done to better understand the discussion about SDGs, eLearning and eHealth.

The purpose of sharing these datasets is to provide the basis to reproduce the results reported in the associated deliverable. These files are not raw data, since due to privacy concerns we can not share personal information from Twitter.

dominant_topics_anonym.xlsx: Excel datasheet. This dataset contians the distribution of the most discussed topics inside the SDGs discussion.

Edges_Hashtag_connected.csv:  CSV file. This dataset contains the edges to build the network of connected hashtags. This edges can be used to build a network and explore the connections or to statiscally analyse the results.

hashtags.csv: CSV file. This dataset contains the results of the most used hashtags in the analysis about eLearning. 
 

hashtags_treemap_health.xlsx: Excel datasheet. This dataset contains the results of the most frequent hashtags in the eHealth analysis.

ldavis_prepared_ieee17.html: HTML file. This file contains the Intertopic distance map and most salient terms from the topic modelling analysis done in the SDGs conversation study.

Most_retweeted_accounts.xlsx: Excel datasheet. This dataset contains the top 20 users that receive more retweets in the conversation around eHealth. The column called Indegree refers to the topological value calculated from the network of retweets. This indegree is equivalent to the number of retweets received. On the other hand, Outdegree is the opposite, so number of retweets given to others.

Most_retweeting_account.xlsx: Excel datasheet. This dataset presents the opposite part of the previous one, the accounts that retweet the most from the eHealth analysis. The columns contain the same indicators: Indegree and Outdegree.

sdgs_count_publish.csv: CSV file. This dataset contains the number of tweets assigned to the different SDGs from the analysis done on the conversation about these Goals.

sdgs_tweets_sdgsaccess.xlsx: Excel datasheet. Same file as the previous one in other format to ease the handling in Excel.

top_hash_health.xlsx: Excel datasheet. The most used hashtags inside the conversation about eHealth.

topics_tweets_sdgsaccess.xlsx: Excel datasheet. Tweets by topic extracted using Machine Learning in the SDGs analysis.

 

This repository will receive updates in the future in order to present all the data available and publishable from the different analysis that were described.

Files

Edges_Hashtag_connected.csv

Files (1.0 MB)

Name Size Download all
md5:da489713098d04dade79617b9fbf6c56
710.3 kB Download
md5:a43bc6c70e972d56f9b305c562f71c9b
34.0 kB Preview Download
md5:0a6ad79270caa8fcf9dca451157b489f
288 Bytes Preview Download
md5:ac9fe79cd38b142d6b0ccc570d5d981a
9.9 kB Download
md5:826ff4fa003046ecdcb833ebe5579747
203.5 kB Download
md5:41d2d904a33de57c5db8add4df64726e
19.5 kB Download
md5:8dbdc6b92b5f0703f1f0bf0cd3775448
18.8 kB Download
md5:366c23d4f6b123193b74c70ae3d691c8
191 Bytes Preview Download
md5:ca3ac5cce723bb122f6c757c11b589c7
9.0 kB Download
md5:407d1eaeadd15e80182cd7dd56889268
9.0 kB Download
md5:23bc2b4d3dd9f58b425cd1668a29b086
9.2 kB Download

Additional details

Funding

European Commission
CS-Track - Expanding our knowledge on Citizen Science through analytics and analysis 872522