Phishing Email Dataset
@kaggle.naserabdullahalam_phishing_email_dataset
@kaggle.naserabdullahalam_phishing_email_dataset
This dataset was compiled by researchers to study phishing email tactics. It combines emails from a variety of sources to create a comprehensive resource for analysis.
Enron and Ling Datasets: These datasets focus on the core content of phishing emails, containing subject lines, email body text, and labels indicating whether the email is spam (phishing) or legitimate.
CEAS, Nazario, Nigerian Fraud, and SpamAssassin Datasets: These datasets provide broader context for the emails, including sender information, recipient information, date, and labels for spam/legitimate classification.
The final dataset combines the information from the initial datasets into a single resource for analysis. This dataset contains:
This dataset allows researchers to study the content of phishing emails and the context in which they are sent to improve detection methods.
Please cite the following two articles if you are using this dataset:
@kaggle
@owid
Share link
Anyone who has the link will be able to view this.