Name: Code-Mixed Sentences Dataset (Hinglish)
Creator: Kaggle
License: http://opendatacommons.org/licenses/odbl/1.0/

Catagorised the sentences in to cyberbullying (1) or Not Cyberbullying (0)

This dataset consists of 25,000 code-mixed Hindi-English (Hinglish) text samples, created to support the development and evaluation of machine learning models for cyberbullying detection. The dataset reflects the informal, Roman-script nature of Hinglish as used on social media, messaging platforms, and online forums.

Related Datasets

SentMix-3L

@kaggle
Dummy Monster

@owid
Individuals - Encountering Hostile Or Degrading Online Messages

@eurostat
Social Media Ban For Minors: A Computational Analysis Of Media Coverage In Europe And Beyond, Dataset

@ecjrc
School By Language

@ukgov
Teen Mental Health By Risk Factor (2015–2023)

@kidscount

SentMix-3L

Dummy Monster

Individuals - Encountering Hostile Or Degrading Online Messages

Social Media Ban For Minors: A Computational Analysis Of Media Coverage In Europe And Beyond, Dataset

School By Language

Teen Mental Health By Risk Factor (2015–2023)