Code-Mixed Sentences Dataset (Hinglish)
@kaggle.pankaazshah_code_mixed_text_dataset_hinglish
Loading...
Loading...
Loading...
Loading...
@kaggle.pankaazshah_code_mixed_text_dataset_hinglish
This dataset consists of 25,000 code-mixed Hindi-English (Hinglish) text samples, created to support the development and evaluation of machine learning models for cyberbullying detection. The dataset reflects the informal, Roman-script nature of Hinglish as used on social media, messaging platforms, and online forums.
@kaggle
@owid
@ukgov
Share link
Anyone who has the link will be able to view this.