Sentiment Analysis for Amazon Product Reviews
Authors/Creators
- 1. Department of Computer Science, Galgotias University, Gautam Buddha Nagar (Uttar Pradesh), India.
- 2. Department of Computer Science, Galgotias University, Gautam Buddha Nagar (Uttar Pradesh), India.
Contributors
Contact person:
- 1. Department of Computer Science, Galgotias University, Gautam Buddha Nagar (Uttar Pradesh), India.
Description
Abstract: Sentiment analysis is a classicfication process whereby machine learning techniques are applied on text-driven datasets in order to analyse the emotion / opinion expressed in a text, e.g. a message being positive or negative about a certain topic. The problem is to conduct a sentiment analysis (positive and negative sentiment) on online product reviews of Products (unlocked mobile phones) sold on Amazon.com. The trained model can be used to predict users’ sentiment based on their online reviews. In this project, different machine learning algorithms are compared, trained and tested on a dataset containing 400000 reviews. The performance of three different algorithms were compared: Multinomial Naive Bayes (MNB), Logistic Regression and Long short-term memory network (LSTM). The Logistic Regression model resulted in the highest performance with Accuracy of 0.95 and AUC of 0.94. The dataset consists of 400 thousand reviews of products (unlocked mobile phones) sold on Amazon.com which is publicly available on Kaggle. Solution to the problem would be useful for a brand to gain a broad sense of user's’ sentiment towards a product through online reviews Further study is needed to investigate if the classfication remains accurate when including more than two classes (e.g. Introducing a neutral class).
Notes
Files
B70990711222.pdf
Files
(703.3 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:2ac7f3c08ebcedca4d48f6bd9612b66b
|
703.3 kB | Preview Download |
Additional details
Related works
- Is cited by
- Journal article: 2277-3878 (ISSN)
References
- Data Source from Kaggle https://www.kaggle.com/PromptCloudHQ/amazon-reviews-unlockedmobile-phon es
- " Working with text Data" from sklearn http://scikitlearn.org/stable/tutorial/text_analytics/working_with_text_data.htm
- "Using pre-trained word embeddings in a Keras model" from Keras Blog https://blog.keras.io/using-pre-trained-word-embeddings-in-akeras-model.html
- "Deep Learning with Word2Vec" from Gensim https://radimrehurek.com/gensim/models/word2vec.html
- "Deep Learning, NLP, and Representations" http://colah.github.io/posts/2014-07-NLP-RNNs-Representations/)
- " An Intuitive Understanding of Word Embeddings: From Count Vectors to Word2Vec" https://www.analyticsvidhya.com/blog/2017/06/word-embeddingscount-word2vee c/
- "Sequence Classification with LSTM Recurrent Neural Networks in Python with Keras" https://machinelearningmastery.com/sequenceclassification-lstm-recurrent-neura l-networks-python-keras/
- "Embedding and Tokenizer in Keras" http://www.orbifold.net/default/2017/01/10/embedding-and-tokenizerin-keras/
Subjects
- ISSN: 2277-3878 (Online)
- https://portal.issn.org/resource/ISSN/2277-3878
- Retrieval Number: 100.1/ijrte.B70990711222
- https://www.ijrte.org/portfolio-item/b70990711222/
- Journal Website: www.ijrte.org
- https://www.ijrte.org/
- Publisher: Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP)
- https://www.blueeyesintelligence.org/