Website Classification
@kaggle.hetulmehta_website_classification
Loading...
Loading...
Loading...
Loading...
@kaggle.hetulmehta_website_classification
This dataset was created by scraping different websites and then classifying them into different categories based on the extracted text.
Below are the values each column has. The column names are pretty self-explanatory.
website_url: URL link of the website.
cleaned_website_text: the cleaned text content extracted from the
@kaggle
@usgov
@usgov
Anyone who has the link will be able to view this.