Code-Mixed Sentences Dataset (Hinglish)
Catagorised the sentences in to cyberbullying (1) or Not Cyberbullying (0)
@kaggle.pankaazshah_code_mixed_text_dataset_hinglish
Catagorised the sentences in to cyberbullying (1) or Not Cyberbullying (0)
@kaggle.pankaazshah_code_mixed_text_dataset_hinglish
This dataset consists of 25,000 code-mixed Hindi-English (Hinglish) text samples, created to support the development and evaluation of machine learning models for cyberbullying detection. The dataset reflects the informal, Roman-script nature of Hinglish as used on social media, messaging platforms, and online forums.
CREATE TABLE hinglish_cyberbullying_dataset_25000 (
"id" BIGINT,
"text" VARCHAR,
"label" BIGINT
);
Anyone who has the link will be able to view this.