Baselight

PhiUSIIL Phishing URLs

134,850 legitimate and 100,945 phishing URLs

@kaggle.joebeachcapital_phiusiil_phishing_url

Loading...
Loading...

About this Dataset

PhiUSIIL Phishing URLs

PhiUSIIL Phishing URL Dataset is a substantial dataset comprising 134,850 legitimate and 100,945 phishing URLs. Most of the URLs we analyzed, while constructing the dataset, are the latest URLs. Features are extracted from the source code of the webpage and URL. Features such as CharContinuationRate, URLTitleMatchScore, URLCharProb, and TLDLegitimateProb are derived from existing features.

Introductory Paper

PhiUSIIL: A diverse security profile empowered phishing URL detection framework based on similarity index and incremental learning
By Arvind Prasad and Shalini Chandra. 2024

Published in Computers & Security

Additional Variable Information

Column "FILENAME" can be ignored.

Class Labels

Label 1 corresponds to a legitimate URL, label 0 to a phishing URL

Tables

Phiusiil Phishing Url Dataset

@kaggle.joebeachcapital_phiusiil_phishing_url.phiusiil_phishing_url_dataset
  • 23.76 MB
  • 235795 rows
  • 56 columns
Loading...

CREATE TABLE phiusiil_phishing_url_dataset (
  "filename" VARCHAR,
  "url" VARCHAR,
  "urllength" BIGINT,
  "domain" VARCHAR,
  "domainlength" BIGINT,
  "isdomainip" BIGINT,
  "tld" VARCHAR,
  "urlsimilarityindex" DOUBLE,
  "charcontinuationrate" DOUBLE,
  "tldlegitimateprob" DOUBLE,
  "urlcharprob" DOUBLE,
  "tldlength" BIGINT,
  "noofsubdomain" BIGINT,
  "hasobfuscation" BIGINT,
  "noofobfuscatedchar" BIGINT,
  "obfuscationratio" DOUBLE,
  "nooflettersinurl" BIGINT,
  "letterratioinurl" DOUBLE,
  "noofdegitsinurl" BIGINT,
  "degitratioinurl" DOUBLE,
  "noofequalsinurl" BIGINT,
  "noofqmarkinurl" BIGINT,
  "noofampersandinurl" BIGINT,
  "noofotherspecialcharsinurl" BIGINT,
  "spacialcharratioinurl" DOUBLE,
  "ishttps" BIGINT,
  "lineofcode" BIGINT,
  "largestlinelength" BIGINT,
  "hastitle" BIGINT,
  "title" VARCHAR,
  "domaintitlematchscore" DOUBLE,
  "urltitlematchscore" DOUBLE,
  "hasfavicon" BIGINT,
  "robots" BIGINT,
  "isresponsive" BIGINT,
  "noofurlredirect" BIGINT,
  "noofselfredirect" BIGINT,
  "hasdescription" BIGINT,
  "noofpopup" BIGINT,
  "noofiframe" BIGINT,
  "hasexternalformsubmit" BIGINT,
  "hassocialnet" BIGINT,
  "hassubmitbutton" BIGINT,
  "hashiddenfields" BIGINT,
  "haspasswordfield" BIGINT,
  "bank" BIGINT,
  "pay" BIGINT,
  "crypto" BIGINT,
  "hascopyrightinfo" BIGINT,
  "noofimage" BIGINT,
  "noofcss" BIGINT,
  "noofjs" BIGINT,
  "noofselfref" BIGINT,
  "noofemptyref" BIGINT,
  "noofexternalref" BIGINT,
  "label" BIGINT
);

Share link

Anyone who has the link will be able to view this.