Email Spam Detection Using Natural Language Processing

Kripa Singh

doi:10.5281/zenodo.19003774

Published March 13, 2026 | Version v1

Journal article Open

Email Spam Detection Using Natural Language Processing

Kripa Singh (Researcher)¹

1. 1 #1 Jharkhand University of Technology, Ranchi, India,

ABSTRACT

The rapid growth of digital communication has led to a significant increase in spam emails, which pose serious threats to information security and user productivity. Spam messages often contain phishing links, malware attachments, and fraudulent advertisements. Traditional rule-based spam filtering methods are increasingly ineffective in detecting evolving spam patterns. Natural Language Processing (NLP) combined with machine learning techniques offers a powerful solution for automated spam detection. This study proposes a machine learning-based framework for identifying spam emails using NLP techniques. The system applies text preprocessing methods such as tokenization, stop-word removal, and term frequency–inverse document frequency (TF–IDF) feature extraction to transform email text into numerical features. Machine learning algorithms including Naïve Bayes, Logistic Regression, Support Vector Machine (SVM), and Random Forest are implemented for classification. Model performance is evaluated using Accuracy, Precision, Recall, and F1-score metrics. Experimental results demonstrate that machine learning-based models can effectively identify spam emails and significantly improve email filtering accuracy.

Key words: Spam Detection, Natural Language Processing, Machine Learning, Email Classification, Text Mining.

Files

18.pdf

Files (292.4 kB)

Name	Size	Download all
18.pdf md5:887a276a28d533591b18eb01da8bfafe	292.4 kB	Preview Download

	All versions	This version
Views	52	52
Downloads	17	17
Data volume	5.8 MB	5.8 MB

Email Spam Detection Using Natural Language Processing

Authors/Creators

Description

Files

18.pdf

Files (292.4 kB)