Baselight

EPL Teams - Twitter Sentiment Dataset

Twitter Sentiment about Teams in the English Premier League

@kaggle.wjia26_epl_teams_twitter_sentiment_dataset

About this Dataset

EPL Teams - Twitter Sentiment Dataset

Introduction

Contains tweets of all teams in the English Premier League with their team hashtag. i.e. #Arsenal, #Chelsea.
Useful for analysing certain events and how they affect supporter behaviour/sentiments.
I've added an additional field called polarity which has the sentiment computed from the text field.
Feel free to explore! Feedback is much appreciated!

Content

Each row represents a tweet. Creation Dates of Tweets Range from 12/07/2020 to 19/09/2020. Will update on a Monthly cadence.

  • The Team
    can be derived from the file_name field.
  • The Date at which the tweet was created can be got from created_at field.
  • The Search Query used to query the Twitter Search Engine can be got from search_query field.
  • The Tweet Full Text can be got from the text field.
  • The Sentiment can be got from polarity field. (I've used the Vader Model from NLTK to compute this.)

Acknowledgements

Thanks to the tweepy package for making the data extraction via Twitter API so easy.

Shameless Plug

Feel free to checkout my blog if you want to learn how I built the datalake via AWS or for other data shenanigans.

Here's an App I built using a live version of this data.

Share link

Anyone who has the link will be able to view this.