Lightweight Phishing Url Detection Using Rule-Based Feature Extraction in Python
Authors/Creators
- 1. Assistant Professor, Department of Commerce (Computer Application) S. A. College of Arts & Science, Chennai.
Contributors
Editor:
Description
Abstract
Phishing is one of the most common online threats where fake websites trick users into sharing sensitive information like passwords or bank details. In this paper, we introduce a simple yet effective method to detect phishing websites by analyzing the structure of their URLs. Instead of using complex or heavy machine learning models, we use Python to create a lightweight system that checks for specific patterns in the URL—like if it’s unusually long, contains suspicious words (such as "secure" or "login"), uses special characters like '@', or even tries to hide its identity using shortened links or IP addresses. We extract these features and use a basic classifier (like Naive Bayes or Decision Tree) to decide if the link is safe or a potential phishing[1] attempt. Our technique performs well on publicly available datasets and requires very little system resources, making it suitable for small tools like browser extensions or email scanners. Overall, the goal of this project is to provide a fast, transparent, and easy-to-use solution that helps protect users online without needing advanced hardware or deep technical knowledge.
Files
97...pdf
Files
(613.7 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:f1a3736092851ab5793f62be7c3a8e99
|
613.7 kB | Preview Download |