Evaluation Metric: Weighted F1 Score

Analytics Vidhya - Identify the Sentiments

Sentiment analysis remains one of the key problems that has seen extensive application of natural language processing. This time around, given the tweets from customers about various tech firms who manufacture and sell mobiles, computers, laptops, etc, the task is to identify if the tweets have a negative sentiment towards such companies or products.

Evaluation Metric

The metric used for evaluating the performance of classification model would be weighted F1-Score.

Public and Private Split

Note that the test data is further randomly divided into Public (35%) and Private (65%) data. Your initial responses will be checked and scored on the Public data. The final rankings would be based on your private score which will be published once the competition is over.

Data

train.csv - For training the models, we provide a labelled dataset of 7920 tweets. The dataset is provided in the form of a csv file with each line storing a tweet id, its label and the tweet.
test.csv - The test data file contains only tweet ids and the tweet text with each tweet in a new line.
sample_submission.csv - The exact format for a valid submission

Most profane and vulgar terms in the tweets have been replaced with “$&@*#”. However, please note that the dataset still might contain text that may be considered profane, vulgar, or offensive.

Related Datasets

F1 Drivers Dataset

@kaggle
SFC2014 - REACT EU Overview Allocation Vs Decided

@esifunds
Trust Questions In The European Social Survey, Latinobarómetro And Afrobarometer

@owid
Ethnic Power Relations Dataset (ETH, 2021)

@owid
Global Forest Resources Assessment

@owid
Trust – Eurostat

@owid

F1 Drivers Dataset

SFC2014 - REACT EU Overview Allocation Vs Decided

Trust Questions In The European Social Survey, Latinobarómetro And Afrobarometer

Ethnic Power Relations Dataset (ETH, 2021)

Global Forest Resources Assessment

Trust – Eurostat