Baselight

Analytics Vidhya - Identify The Sentiments

Evaluation Metric: Weighted F1 Score

@kaggle.anmolkumar_analytics_vidhya_identify_the_sentiments

About this Dataset

Analytics Vidhya - Identify The Sentiments

Analytics Vidhya - Identify the Sentiments

Sentiment analysis remains one of the key problems that has seen extensive application of natural language processing. This time around, given the tweets from customers about various tech firms who manufacture and sell mobiles, computers, laptops, etc, the task is to identify if the tweets have a negative sentiment towards such companies or products.

Evaluation Metric

The metric used for evaluating the performance of classification model would be weighted F1-Score.

Public and Private Split

Note that the test data is further randomly divided into Public (35%) and Private (65%) data. Your initial responses will be checked and scored on the Public data. The final rankings would be based on your private score which will be published once the competition is over.

Data

  • train.csv - For training the models, we provide a labelled dataset of 7920 tweets. The dataset is provided in the form of a csv file with each line storing a tweet id, its label and the tweet.

  • test.csv - The test data file contains only tweet ids and the tweet text with each tweet in a new line.

  • sample_submission.csv - The exact format for a valid submission

Most profane and vulgar terms in the tweets have been replaced with “$&@*#”. However, please note that the dataset still might contain text that may be considered profane, vulgar, or offensive.

Share link

Anyone who has the link will be able to view this.