Baselight

Cleaned Toxic Comments

Preprocessed data for Toxic Comments Classification Challenge

@kaggle.fizzbuzz_cleaned_toxic_comments

About this Dataset

Cleaned Toxic Comments

Preporcessed Toxic Comments Classification Dataset

The obstacle I faced in Toxic Comments Classification Challenge was the preprocessing part. One can easily improve their LB performance if the preprocessing is done right.

This is the preprocessed version of Toxic Comments Classification Challenge dataset. The code for preprocessing: https://www.kaggle.com/fizzbuzz/toxic-data-preprocessing

Share link

Anyone who has the link will be able to view this.