Website Classification
classify website URLs to different categories
@kaggle.hetulmehta_website_classification
classify website URLs to different categories
@kaggle.hetulmehta_website_classification
This dataset was created by scraping different websites and then classifying them into different categories based on the extracted text.
Below are the values each column has. The column names are pretty self-explanatory.
website_url: URL link of the website.
cleaned_website_text: the cleaned text content extracted from the
CREATE TABLE website_classification (
"unnamed_0" BIGINT -- Unnamed: 0,
"website_url" VARCHAR,
"cleaned_website_text" VARCHAR,
"category" VARCHAR
);Anyone who has the link will be able to view this.