Published June 10, 2021 | Version v1
Dataset Open

Phishing website dataset

  • 1. Eindhoven University of Technology

Description

The dataset comprises phishing and legitimate web pages, which have been used for experiments on early phishing detection.

Detailed information on the dataset and data collection is available at

Bram van Dooremaal, Pavlo Burda, Luca Allodi, and Nicola Zannone. 2021.Combining Text and Visual Features to Improve the Identification of Cloned Webpages for Early Phishing Detection. In ARES '21: Proceedings of the 16th International Conference on Availability, Reliability and Security. ACM.

 

 

 

Notes

This work is supported by the ITEA3 programme through the DEFRAUDIfy project funded by Rijksdienst voor Ondernemend Nederland (grant no.~ITEA191010).

Files

associated-rawdata.zip

Files (38.2 GB)

Name Size Download all
md5:6c0d089e9937e718f708a8b6276fba76
7.5 GB Preview Download
md5:9d0540df0c0d4f098f1df957fa09f666
10.0 GB Preview Download
md5:3952a079a422e97bcced124e6f7d41d9
20.7 GB Preview Download