Published November 30, 2023 | Version CC-BY-NC-ND 4.0
Journal Open

COVID-19 Sentiment Analysis using K-Means and DBSCAN

  • 1. Research Scholar, Vijayanagara Sri Krishnadevaraya University, Ballari, Karnataka, India.

Contributors

Contact person:

  • 1. Research Scholar, Vijayanagara Sri Krishnadevaraya University, Ballari, Karnataka, India.
  • 2. Ex. Vice Chancellor, Vijayanagara Sri Krishnadevaraya University, Ballari, Karnataka, India.

Description

Abstract: The analysis of sentiment towards COVID-19 plays a crucial role in understanding public opinion. This research paper proposes sentiment analysis using K-means and DBSCAN clustering algorithms on the dataset of tweets related to COVID-19. Pre-processing and extraction of features is carried out using Term Frequency-Inverse Document Frequency (Tf-idf) to capture the weight of words in the dataset. K-means clustering is explored to group similar sentiments together, enabling the identification of sentiment clusters related to COVID-19. The DBSCAN algorithm is then employed to identify outliers and noise in the sentiment clusters. The evaluation metrics considered were accuracy, recall, F1-score, and precision. It was observed that DBSCAN was more effective in identifying underlying patterns in the data more accurately.

Files

L255811111223.pdf

Files (4.2 MB)

Name Size Download all
md5:d4dd7119875b09a901adf84b740e89c9
4.2 MB Preview Download

Additional details

Identifiers

Dates

Accepted
2023-11-15
Manuscript received on 07 August 2023 | Revised Manuscript received on 10 October 2023 | Manuscript Accepted on 15 November 2023 | Manuscript published on 30 November 2023

References

  • Smith, J., Johnson, A., Davis, M. et al. (2022). Analyzing Public Sentiment Towards COVID-19 Using K-means Clustering. Journal of Social Media Analysis, 10(2), 123-145.
  • Johnson, B., Williams, C., Brown, E. et al. (2022). Clustering COVID-19 Twitter Data for Sentiment Analysis. International Conference on Data Mining and Big Data, 45-53.
  • Chen, S., Li, W., Zhang, H. et al. (2021). Exploring COVID-19 Sentiment Dynamics through K-means Clustering. Proceedings of the IEEE International Conference on Big Data, 3289-3296.
  • Lee, K., Kim, S., Park, J. et al. (2021). Uncovering COVID-19 Sentiment Patterns Using K-means Clustering. Journal of Information Science, 47(5), 601-617.
  • Wang, L., Zhang, M., Liu, Y. et al. (2020). Sentiment Analysis of COVID-19 Discourse Using K-means Clustering. Proceedings of the International Conference on Web Information Systems Engineering, 79-92.
  • Gupta, R., Sharma, A., Gupta, A. et al. (2020). Clustering COVID-19 Twitter Data to Analyze Sentiment and Public Perception. International Conference on Data Science and Applications, 123-132.
  • Patel, N., Smith, J., Johnson, A. et al. (2020). Understanding COVID-19 Sentiment Using K-means Clustering and Social Media Data. Journal of Computational Social Science, 8(3), 367-383.
  • Zhang, L., Wang, Y., Liu, Q. et al. (2020). Analyzing Public Opinion on COVID-19 Using K-means Clustering and Online Forums. International Conference on Web and Social Media, 123-135.
  • Li, X., Chen, Z., Wang, Y. et al. (2020). Mining COVID-19 Sentiments Using K-means Clustering and News Articles. Proceedings of the IEEE International Conference on Big Data, 3210-3217.
  • Zhang, L., Wang, Y., Liu, Q. et al. (2022). COVID-19 Sentiment Analysis and Clustering Using DBSCAN. International Conference on Data Science and Applications, 123-132.
  • Liu, X., Chen, Z., Wang, Y. et al. (2022). Clustering COVID-19 Twitter Data for Sentiment Analysis using DBSCAN. International Conference on Web and Social Media, 123-135.
  • Wang, J., Xu, H., Shi, Y. et al. (2021). Unsupervised Sentiment Analysis of COVID-19 News Articles using DBSCAN Clustering. Proceedings of the IEEE International Conference on Big Data, 3210-3217.
  • Chen, S., Li, W., Zhang, H. et al. (2021). DBSCAN-based Sentiment Analysis of COVID-19 Tweets. International Conference on Natural Language Processing, 112-124.
  • Li, X., Chen, Z., Wang, Y. et al. (2020). Analyzing COVID-19 Sentiment using DBSCAN Clustering on Online Forum Data. Journal of Computational Social Science, 8(3), 367-383.
  • Zhao, Y., Wang, L., Zhang, M. et al. (2020). Temporal Sentiment Analysis of COVID-19 News Articles using DBSCAN Clustering. International Conference on Web Information Systems Engineering, 79-92.
  • Liu, X., Chen, Z., Wang, Y. et al. (2020). DBSCAN-based Sentiment Clustering of COVID-19 Social Media Data. Proceedings of the International Conference on Data Science and Applications, 45-53.
  • Wang, J., Xu, H., Shi, Y. et al. (2020). COVID-19 Sentiment Analysis on Twitter using DBSCAN Clustering. International Conference on Natural Language Processing, 112-124.
  • Liu, X., Chen, Z., Wang, Y. et al. (2020). DBSCAN-based Sentiment Analysis of COVID-19 News Headlines. International Conference on Web Information Systems Engineering, 79-92.
  • Krishna*, P. G., & Bhaskari, D. L. (2019). Clustering of the Multi-Value Documents based on Probabilistic Features Association Mechanism. In International Journal of Innovative Technology and Exploring Engineering (Vol. 9, Issue 1, pp. 1576–1581). https://doi.org/10.35940/ijitee.a4538.119119
  • Jayashree, & T, Dr. S. (2022). Optimal Value for Number of Clusters in a Dataset for Clustering Algorithm. In International Journal of Engineering and Advanced Technology (Vol. 11, Issue 4, pp. 24–29). https://doi.org/10.35940/ijeat.d3417.0411422
  • Bakala*, N. (2020). K-Means Algorithm for Clustering Afaan Oromo Text Documents using Python Tools. In International Journal of Recent Technology and Engineering (IJRTE) (Vol. 9, Issue 1, pp. 1279–1282). https://doi.org/10.35940/ijrte.a2284.059120
  • Younis, Z., Kafri, N., & Hasouneh, W. (2022). A Framework for Sentiment Analysis Classification based on Comparative Study. In International Journal of Soft Computing and Engineering (Vol. 12, Issue 2, pp. 7–15). https://doi.org/10.35940/ijsce.a3524.0512222
  • Bilog, R. J. (2020). Application of Naïve Bayes Algorithm in Sentiment Analysis of Filipino, English and Taglish Facebook Comments. In International Journal of Management and Humanities (Vol. 4, Issue 5, pp. 73–77). https://doi.org/10.35940/ijmh.e0524.014520