Published May 30, 2022 | Version CC BY-NC-ND 4.0
Journal article Open

A Framework for Sentiment Analysis Classification based on Comparative Study

  • 1. Department of Computer Science, Al-Quds University, Al-Quds, Palestine.

Contributors

Contact person:

  • 1. Department of Computer Science, Al-Quds University, Al-Quds, Palestine.

Description

Abstract: A number of Feature Selection and Ensemble Methods for Sentiment Analysis Classification had been introduced in many searches. This paper presents A frame work for sentiment analysis classification based on comparative study on different classification algorithms i.e., comparison between combinations of classification algorithms: Bayes, SVM, Decision Tree. We also examined the effect of using feature selection methods (statistical, wrapper, or embedded), ensemble methods (Bagging, Boosting, Stacking, or Vote), tuning parameters of methods (SVMAttributeEval, Stacking), and the effect of merging feature subsets selected by embedded method on the classification accuracy. Particularly, the results showed that accuracy depends on the feature selection method, ensemble methods, number of selected features, type of classifier, and tuning parameters of the algorithms used. A high accuracy of up to 99.85% was achieved by merging features of two embedded methods when using stacking ensemble method. Also, a high accuracy of 99.5% was achieved by tuning parameters in stacking method, and it reached 99.95% and 100% by tuning parameters in SVMAttributeEval method using statistical and machine learning approaches, respectively. Furthermore, tuning algorithms' parameters reduced the time needed to select feature subsets. Thus, these combinations of algorithms can be followed as a frame work for sentiment analysis.

Notes

Published By: Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP) © Copyright: All rights reserved.

Files

A35240911121.pdf

Files (1.6 MB)

Name Size Download all
md5:b5bfc3f8bfa22b733db486b76be5072b
1.6 MB Preview Download

Additional details

Related works

Is cited by
Journal article: 2231-2307 (ISSN)

References

  • Zin H., Mustapha.N, Murad M., and Sharef N., "The effects of pre-processing strategies in sentiment analysis of online movie reviews," AIP Conference Proceedings 1891, 020089, 2017.
  • Isalm M. and Sultana N., "Comparative Study on Machine Learning Algorithms for Sentiment Classification," International Journal of Computer Applications, vol. 182, no. 21, pp. 1–7, 2018.
  • Kumbhar P. and Mali M. "A Survey on Feature Selection Techniques and Classification Algorithms for Efficient Text Classification," International Journal of Science and Research (IJSR), vol. 5, no. 5, pp. 1267–1275, May 2016.
  • Joshi N. and Srivastava S., "Improving Classification Accuracy Using Ensemble Learning Technique (Using Different Decision Trees)," 2014.
  • Pant S. and Jain K., "Sentiment Analysis Using Feature Selection and Classification Algorithms- a Survey," International Journal of Innovative in Engineering Research and Technology [IJIERT] ISSN: 2394-3696 VOLUME 4, ISSUE 5, May 2017.
  • Sahayak V., Shete V., and Pathan A., "Sentiment Analysis on Twitter Data," International Journal of Innovative Research in Advanced Engineering (IJIRAE) Issue 1, Vol. 2, January 2015.
  • Gautam G. and Yadav D., "Sentiment analysis of twitter data using machine learning approaches and semantic analysis," 2014 Seventh International Conference on Contemporary Computing (IC3), 2014.
  • Medhat W., Hassan A., and Korashy H., "Sentiment analysis algorithms and applications: A survey," Ain Shams Engineering Journal, vol. 5, no. 4, pp. 1093–1113, 2014.
  • Binali H., Potdar V., and Wu C., "A state of the art opinion mining and its application domains," 2009 IEEE International Conference on Industrial Technology, 2009.
  • B. Liu and L. Zhang, "A Survey of Opinion Mining and Sentiment Analysis," Mining Text Data, pp. 415–463, 2012.
  • Kotsiantis S., Kanellopoulos D., and Pintelas P., "Data Preprocessing for Supervised Learning," International Journal of Computer Science Volume 1 Number 1 ISSN 1306-4428, 2006.
  • Hirapara Sh., et al., "Survey on Opinion Mining and Feature Selection," International Journal of Innovative Research in Computer and Communication Engineering (An ISO 3297: 2007 Certified Organization), https:// www.ijircce.com Vol. 5, Issue 3, March 2017.
  • Wijayasekara D., Manic M., and Mcqueen M., "Information gain based dimensionality selection for classifying text documents," 2013 IEEE Congress on Evolutionary Computation, 2013.
  • Saif H., et al., "Semantic Sentiment Analysis of Twitter," The Semantic Web – ISWC 2012 Lecture Notes in Computer Science, pp. 508–524, 2012.
  • O'Keefe T. and Koprinska I.," Feature Selection and Weighting Methods in Sentiment Analysis," Proceedings of the 14th Australasian Document Computing Symposium, 2009.
  • Madasu A. and Elango S., "Efficient feature selection techniques for sentiment analysis," Multimedia Tools and Applications, vol. 79, no. 9-10, pp. 6313–6335, 2019.
  • Gnanambal S., et al., "Classification Algorithms with Attribute Selection: An Evaluation Study using WEKA," International Journal of Advanced Networking and Applications, Volume: 09 Issue: 06 Pages: 3640-3644 (2018) ISSN: 0975-0290, April 2018.
  • Witten I., et al., (1999). "Weka: Practical machine learning tools and techniques with Java implementations," Working paper 99/11, Hamilton, New Zealand: University of Waikato, Department of Computer Science, 1999.
  • Rijn J. and Hutter F., "Hyperparameter Importance Across Datasets," Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018.
  • Govindarajan M., "Sentiment Analysis of Movie Reviews using Hybrid Method of Naive Bayes and Genetic Algorithm," International Journal of Computer Research, 2013.
  • Parmar H., et al., "Sentiment Mining of Movie Reviews using Random Forest with Tuned Hyperparameters," 2014.
  • Agarwal B., et al., "Concept-Level Sentiment Analysis with Dependency-Based Semantic Parsing: A Novel Approach," Cognitive Computation, vol. 7, no. 4, pp. 487–499, 2015.
  • Devaraj M., Piryani R., and Singh V., "Lexicon Ensemble and Lexicon Pooling for Sentiment Polarity Detection," IETE Technical Review, vol. 33, no. 3, pp. 332–340, 2015.
  • Agarwal B. and Mittal N., "Prominent feature extraction for review analysis: an empirical study," Journal of Experimental & Theoretical Artificial Intelligence, vol. 28, no. 3, pp. 485–498, 2014.
  • Yousefpour A., Ibrahim R., and A. Hamed H., "Ordinal-based and frequency-based integration of feature selection methods for sentiment analysis," Expert Systems with Applications, vol. 75, pp. 80–93, 2017.

Subjects

ISSN: 2231-2307 (Online)
https://portal.issn.org/resource/ISSN/2231-2307#
Retrieval Number: 100.1/ijsce.A35240911121
https://www.ijsce.org/portfolio-item/a35240911121/
Journal Website: www.ijsce.org
https://www.ijsce.org/
Publisher: Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP)
https://www.blueeyesintelligence.org