Spam Email Classification Dataset
Combined Spam Email CSV of 2007 TREC Public Spam Corpus and Enron-Spam Dataset
@kaggle.purusinghvi_email_spam_classification_dataset
Combined Spam Email CSV of 2007 TREC Public Spam Corpus and Enron-Spam Dataset
@kaggle.purusinghvi_email_spam_classification_dataset
This is a csv file containing 83446 records of email which are labelled as either spam or not-spam. It is formed by combining the 2007 TREC Public Spam Corpus and Enron-Spam Dataset.
Code for combining and processing the two datasets: https://github.com/PuruSinghvi/Spam-Email-Classifier/blob/main/Combining%20Datasets.ipynb
A spam email classifier has been trained and built using this dataset.
It can be found here: https://github.com/PuruSinghvi/Spam-Email-Classifier
CREATE TABLE combined_data (
"label" BIGINT,
"text" VARCHAR
);Anyone who has the link will be able to view this.