Classification of a URL if spam or not spam

URL - Spam or Not Spam - Classification Dataset

This dataset contains about 87.5K URLs in which one-third are flagged as a spam URL and restrict are not spam. It can be used to create a binary classification model.

Credits:

The dataset was created by The Pudding. This dataset of every link is found in different newsletters. The flagging system identifies if a link is a spam or not, as it parses links from over 100 newsletters every 30 minutes. A link is programatically f flagged if it appears 3+ times in a single newsletter or contains a likely subscribe/unsubscribe URL. If you use this dataset, don't forget to cite the author.

Related Datasets

Spam Email Dataset

@kaggle
Fur Banning

@owid
Nuclear Weapons Proliferation

@owid
Ethnic Power Relations Dataset (ETH, 2021)

@owid
2020 PREDICT Dataset (deprecated)

@ecjrc
TGS SC2 Nasal Positivity

@cdc

Spam Email Dataset

Fur Banning

Nuclear Weapons Proliferation

Ethnic Power Relations Dataset (ETH, 2021)

2020 PREDICT Dataset (deprecated)

TGS SC2 Nasal Positivity