Dataset: Emo (Infer Underlying Emotion)

About this Dataset

Emo (Infer Underlying Emotion)

Textual dialogue along with previous context. Goal: Infer Underlying Emotions

By Huggingface Hub [source]

About this dataset

More Datasets

For more datasets, click here.

Featured Notebooks

🚨 Your notebook can be here! 🚨!

How to use the dataset

The dataset consists of three csv files: train.csv, test.csv and validation.csv, each containing text and labels for psychological conversations between two people, allowing for the identification of underlying emotions of each utterance. This is perfect for gaining insights into complex relationships between emotions and textual conversations. The data includes the full dialogue between two people, along with two previous turns of context and a label for each turn which is either Happy, Sad, Angry or Others.

Here are steps on how you could use this data set:

Read in and explore the data – First off begin by reading in your CSV file or files if using both training and test sets into python such as Pandas to quickly familiarize yourself with your data set; looking at its size (number of columns/rows), datatypes (str/int), missing values etc . This will give you better understanding when manipulating it later on down the line

Data preprocessing – Here we want to clean up our text by removing punctuation & other unnecessary characters; normalizing word lengths; parsing out whats being said from who’s saying it using machine learning algorithms like Natural Language Understanding(NLU); & lastly converting it all into a numerical format so that our model can understand it better

Modelling – Now that our Neural Network has been properly trained we can begin applying various machine learning approaches such as deep Artificial Neural Networks (ANNs), Convolutional Neural Networks (CNNs) , Long Short-Term Memory networks (LSTMs) & many more! Given their complexity these models take in large amounts of data which makes them suitable well suited considering that they are quite complex while taking in very large amounts of information but have been proven successful at automatically extracting patterns from raw textual dialogues similar to what is found within this dataset

Evaluation – No matter which approach was taken there should always be a way of evaluating results whether via manual inspection or quantitative metrics such as accuracy scores etc..Inspecting numerous examples manually allows you to become familiar with errors that might occur i therefore build up an intuition around what type errors needs attention first if any

Once completed these 4 steps provide us enough information

Research Ideas

Developing automatic dialogue systems that can recognize the feelings and emotions expressed by conversations

Building a sentiment analysis tool to extract emotions from conversations

Training machine learning models to predict the emotion of each utterance in a conversation

Acknowledgements

If you use this dataset in your research, please credit the original authors.

Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: train.csv

Column name	Description
text	The text of the conversation between two people. (String)
label	The emotion associated with each turn of dialogue (Happy, Sad, Angry or Others). (String)

File: test.csv

Column name	Description
text	The text of the conversation between two people. (String)
label	The emotion associated with each turn of dialogue (Happy, Sad, Angry or Others). (String)

Acknowledgements

If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit Huggingface Hub.

Tables

Test

@kaggle.thedevastator_uncover_the_emotions_behind_textual_dialogues.test

236.89 KB
5509 rows
2 columns


CREATE TABLE test (
  "text" VARCHAR,
  "label" BIGINT
);

Train

@kaggle.thedevastator_uncover_the_emotions_behind_textual_dialogues.train

1.27 MB
30160 rows
2 columns


CREATE TABLE train (
  "text" VARCHAR,
  "label" BIGINT
);