Published May 16, 2024 | Version v1
Dataset Open

Mental Health in Social Networks with Machine Learning Algorihtms

  • 1. Universidad de valladolid
  • 2. Universidad de Valladolid

Contributors

Contact person:

Data collector:

  • 1. Universidad de Valladolid

Description

The DatasetMH.xlsx excel corresponds to a corpus of mental health in social networks labelled with polarity and stigma. In particular, the corpus consists of 2,287 comments labelled with polarity (positive, negative, neutral) and stigma from comments on Instagram posts about celebrity mental health disclosures:

  1. Polarity: It consists of giving a positive, negative or neutral/undefined value to the comments in response to the disclosure or description of the symptomatology in the post. Positive polarity reflects understanding, encouragement or even admiration of the publication. E.g., “Cheer up, we love you".”. Negative polarity is assigned when the person expresses negative opinions, usually questioning the post with ironic, sarcastic or even mocking and disparaging comments. E.g., “how you show that you don't know what depression or anxiety is, shame on you!”. Neutral or undefined polarity is assigned in cases where no clear opinion is detected or can be interpreted in both directions. E.g., “take medication, it will help you" "and your partner?”
  2. Stigma: stigmatising responses to comments are behaviours in which negative beliefs and emotions towards MH problems are expressed. Stigma manifests in a variety of forms including rejection and anger against the person, which may extend to contempt or mockery, belittling their problem. E.g.,"What a desire to draw attention to yourself"; "what you have is a story"; "you're so inconsistent and seeking the limelight". Because socially we know that "stigma is wrong" many rejection comments are made in an ironic or sarcastic way. E.g., and how do you write on insta?"; "better information from someone who doesn't have a current account". Additionally, anger is shown by arguing that such posts "trivialise or commercialise" MH. E.g.,"don't come and tell me your false stories of overcoming, without even knowing what it is to work...". Other times the stigma manifests itself as pity or sorrow for the person. E.g.,“It breaks my heart”; “poor thing”.

The file DatasetMH_Emotions.xlsx corresponds to a corpus of mental health in social networks labelled with emotions. In particular, the corpus consists of 2,287 comments labelled with five emotions plus a neutral class from comments on Instagram posts about celebrity mental health disclosures. These emotions are:

  • Love/admiration: This emotion involves messages where admiration, approval and love are closely related.
  • Gratitude: the messages imply a sincere appreciation for sharing mental health-related content on social networks.
  • Comprehension/empathy/identification: The messages involve interest in and understanding of the message, including self-identification with the situation or context.
  • Sadness: This primary emotion is produced by events that are not pleasant and that denote heaviness. It includes many manifestations of pity for the person.
  • Anger/contempt/mockery: This emotion involves responses of irritation and attacks on the person as ridiculous and superficial.
  • Neutral: This category corresponds to messages without emotions.

The labelling process of both datasets was divided into two phases: an initial phase with a pilot corpus (N = 787 comments) and a second phase focused on the development of the corpus with all the comments of the selected posts (N = 21151). The same methodology was followed in both phases: once the comments were collected, the corpus was cleaned, and then two independent experts were responsible for labelling each category. A third expert then reviewed the comments to resolve discrepancies. In the third and final phase, a final corpus for application to the machine learning algorithms is built from the large corpus (N = 2287).

Classification models are a set of machine learning algorithms developed to assess emotional response, i.e. polarity, stigma and emotions in social networks, based on previously developed datasets (DatasetMH_Emotions.xlsx, DatasetMH.xlsx).

Files

BERT_Emotions.ipynb

Files (3.2 MB)

Name Size Download all
md5:af6a83eeb4899e5af5ca90cb60a5d493
2.7 MB Preview Download
md5:c05bc00f5d5eb77e3df95a1195d9a930
168.7 kB Download
md5:f0a00339ec630795b6a572784253f51c
161.3 kB Download
md5:b7f027455491710620813f9ecda9aff7
17.6 kB Download
md5:5af9d4d49bf063bcf13f90438657fd82
19.2 kB Download
md5:a19367e7cf7ee611da6db36de5297fb4
46.4 kB Preview Download
md5:aced76fe37fc2e5c7801baa65221d7f6
4.1 kB Preview Download
md5:cfc9876ddbea8a7da364304530fdbf29
40.5 kB Preview Download

Additional details

Dates

Available
2024

Software

Repository URL
https://github.com/GCOdeveloper/Mental-Health-Dataset
Programming language
Python
Development Status
Active