Email Spam Detection Using Natural Language Processing
Authors/Creators
- 1. 1 #1 Jharkhand University of Technology, Ranchi, India,
Description
ABSTRACT
The rapid growth of digital communication has led to a significant increase in spam emails, which pose serious threats to information security and user productivity. Spam messages often contain phishing links, malware attachments, and fraudulent advertisements. Traditional rule-based spam filtering methods are increasingly ineffective in detecting evolving spam patterns. Natural Language Processing (NLP) combined with machine learning techniques offers a powerful solution for automated spam detection. This study proposes a machine learning-based framework for identifying spam emails using NLP techniques. The system applies text preprocessing methods such as tokenization, stop-word removal, and term frequency–inverse document frequency (TF–IDF) feature extraction to transform email text into numerical features. Machine learning algorithms including Naïve Bayes, Logistic Regression, Support Vector Machine (SVM), and Random Forest are implemented for classification. Model performance is evaluated using Accuracy, Precision, Recall, and F1-score metrics. Experimental results demonstrate that machine learning-based models can effectively identify spam emails and significantly improve email filtering accuracy.
Key words: Spam Detection, Natural Language Processing, Machine Learning, Email Classification, Text Mining.
Files
18.pdf
Files
(292.4 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:887a276a28d533591b18eb01da8bfafe
|
292.4 kB | Preview Download |