Datasets containing the results from the analysis on SDGS and eHealth inside the Citizen Science Community on Twitter
Description
This datasets contain the results from our analyses of the Citizen Science Community on Twitter. These analyses have been done to better understand the discussion about SDGs, eLearning and eHealth.
The purpose of sharing these datasets is to provide the basis to reproduce the results reported in the associated deliverable. These files are not raw data, since due to privacy concerns we can not share personal information from Twitter.
dominant_topics_anonym.xlsx: Excel datasheet. This dataset contians the distribution of the most discussed topics inside the SDGs discussion.
Edges_Hashtag_connected.csv: CSV file. This dataset contains the edges to build the network of connected hashtags. This edges can be used to build a network and explore the connections or to statiscally analyse the results.
hashtags.csv: CSV file. This dataset contains the results of the most used hashtags in the analysis about eLearning.
hashtags_treemap_health.xlsx: Excel datasheet. This dataset contains the results of the most frequent hashtags in the eHealth analysis.
ldavis_prepared_ieee17.html: HTML file. This file contains the Intertopic distance map and most salient terms from the topic modelling analysis done in the SDGs conversation study.
Most_retweeted_accounts.xlsx: Excel datasheet. This dataset contains the top 20 users that receive more retweets in the conversation around eHealth. The column called Indegree refers to the topological value calculated from the network of retweets. This indegree is equivalent to the number of retweets received. On the other hand, Outdegree is the opposite, so number of retweets given to others.
Most_retweeting_account.xlsx: Excel datasheet. This dataset presents the opposite part of the previous one, the accounts that retweet the most from the eHealth analysis. The columns contain the same indicators: Indegree and Outdegree.
sdgs_count_publish.csv: CSV file. This dataset contains the number of tweets assigned to the different SDGs from the analysis done on the conversation about these Goals.
sdgs_tweets_sdgsaccess.xlsx: Excel datasheet. Same file as the previous one in other format to ease the handling in Excel.
top_hash_health.xlsx: Excel datasheet. The most used hashtags inside the conversation about eHealth.
topics_tweets_sdgsaccess.xlsx: Excel datasheet. Tweets by topic extracted using Machine Learning in the SDGs analysis.
This repository will receive updates in the future in order to present all the data available and publishable from the different analysis that were described.
Files
Edges_Hashtag_connected.csv
Files
(1.0 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:da489713098d04dade79617b9fbf6c56
|
710.3 kB | Download |
|
md5:a43bc6c70e972d56f9b305c562f71c9b
|
34.0 kB | Preview Download |
|
md5:0a6ad79270caa8fcf9dca451157b489f
|
288 Bytes | Preview Download |
|
md5:ac9fe79cd38b142d6b0ccc570d5d981a
|
9.9 kB | Download |
|
md5:826ff4fa003046ecdcb833ebe5579747
|
203.5 kB | Download |
|
md5:41d2d904a33de57c5db8add4df64726e
|
19.5 kB | Download |
|
md5:8dbdc6b92b5f0703f1f0bf0cd3775448
|
18.8 kB | Download |
|
md5:366c23d4f6b123193b74c70ae3d691c8
|
191 Bytes | Preview Download |
|
md5:ca3ac5cce723bb122f6c757c11b589c7
|
9.0 kB | Download |
|
md5:407d1eaeadd15e80182cd7dd56889268
|
9.0 kB | Download |
|
md5:23bc2b4d3dd9f58b425cd1668a29b086
|
9.2 kB | Download |