HYBRID SPAM FILTERING USING SVM FOR CLASSIFICATION AND FEATURE ENGINEERING

Dr. Chukwuemeka, Godwin Obinna

doi:10.5281/zenodo.18668219

Published January 14, 2026 | Version v1

Publication Open

HYBRID SPAM FILTERING USING SVM FOR CLASSIFICATION AND FEATURE ENGINEERING

Dr. Chukwuemeka, Godwin Obinna¹

1. Department of Computer Science, University of Nigeria, Nsukka, Enugu State, Nigeria

Spam emails pose a persistent and escalating challenge in digital communication, resulting in wasted bandwidth, reduced productivity, and increased cybersecurity threats. Traditional rule-based filtering systems have proven inefficient, particularly against evolving spam tactics such as obfuscation and disguised content. This study investigates the application of Support Vector Machine (SVM), a robust supervised machine learning algorithm, for the classification and feature extraction of spam in email datasets. We explore how SVM leverages high-dimensional feature spaces to separate spam from legitimate (ham) emails with precision. The methodology involves preprocessing email data, extracting key linguistic and structural features, and training an SVM classifier on labeled datasets to distinguish spam from ham. Compared to rule-based and other conventional techniques, the SVM approach requires minimal manual intervention and adapts more efficiently to new spam trends. Empirical results demonstrate high accuracy, precision, and recall rates in spam detection, validating the effectiveness of the model. Furthermore, the study identifies the most relevant features contributing to spam classification, offering insights into how modern spammers manipulate email content. These findings not only enhance spam detection capabilities but also contribute to the development of intelligent and adaptive email filtering systems. The study concludes that SVM remains a valuable tool in the fight against spam, especially when integrated with continuous learning and real-time data streams

Files

2.pdf

Files (648.0 kB)

Name	Size	Download all
2.pdf md5:aeac1b007350e0dd08f16c487a2492ba	648.0 kB	Preview Download

	All versions	This version
Views	9	9
Downloads	2	2
Data volume	1.3 MB	1.3 MB

HYBRID SPAM FILTERING USING SVM FOR CLASSIFICATION AND FEATURE ENGINEERING

Authors/Creators

Description

Files

2.pdf

Files (648.0 kB)