Baselight
Sign In
kaggle

Code-Mixed Sentences Dataset (Hinglish)

Kaggle

@kaggle.pankaazshah_code_mixed_text_dataset_hinglish

Loading...
Loading...

Catagorised the sentences in to cyberbullying (1) or Not Cyberbullying (0)

Dataset Description

This dataset consists of 25,000 code-mixed Hindi-English (Hinglish) text samples, created to support the development and evaluation of machine learning models for cyberbullying detection. The dataset reflects the informal, Roman-script nature of Hinglish as used on social media, messaging platforms, and online forums.


Related Datasets

Share link

Anyone who has the link will be able to view this.