Baselight

NFL Twitter Sentiment Analysis

Labelled NFL related tweets | sentiment analysis | scraped from twitter

@kaggle.cammyc_nfl_twitter_sentiment_analysis

Loading...
Loading...

About this Dataset

NFL Twitter Sentiment Analysis

NFL Team Sentiment Analysis Dataset

Overview

This dataset contains sentiment-labeled social media posts about NFL teams, specifically focused on tracking fan reactions and public sentiment. Each entry has been processed through a fine-tuned RoBERTa model with custom sports context modifiers to ensure accurate sentiment classification in the context of NFL-specific language and events.

Data was scraped via selenium web scraper
📝 View Code on GitHub

Dataset Description

Features

  • username: Author of the post
  • timestamp: Time of post creation (format: YYYY-MM-DDThh:mm:ss.000Z)
  • text: Content of the post
  • team: NFL team referenced in the post
  • sentiment: Classified sentiment (positive, negative, or neutral)
  • confidence: Model's confidence score for the sentiment classification (0-1)
  • roberta_raw_outputs: Raw probability distributions from the model [negative, neutral, positive]

Key Statistics

  • Total Records: 3701 labeled posts
  • Time Period: November 2024
  • Teams Covered: Denver Broncos, Indianapolis Colts, Cleveland Browns, Jacksonville Jaguars, Kansas City Chiefs, Detroit Lions

Collection Methodology

  • Data sourced from public social media posts on twitter
  • Processed through a RoBERTa model fine-tuned for NFL context
  • Custom sports-specific context modifiers applied for improved accuracy
  • Automated cleaning to remove sales-related posts, media-only content, and irrelevant material

Potential Applications

  • Fan sentiment analysis during NFL season
  • Team performance impact on public opinion
  • Temporal analysis of fan reactions
  • Sports marketing research
  • Social media engagement studies

Limitations

Does not currently have all of the NFL teams, but the dataset will be updated over time and I will soon remove this.

License

MIT

Note: This dataset contains publicly available social media posts. Users should comply with platform terms of service and privacy considerations.

Tables

Nfl Sentiments

@kaggle.cammyc_nfl_twitter_sentiment_analysis.nfl_sentiments
  • 778.07 kB
  • 5,171 rows
  • 16 columns
Loading...
CREATE TABLE nfl_sentiments (
  "username" VARCHAR,
  "timestamp" VARCHAR,
  "text" VARCHAR,
  "team" VARCHAR,
  "sentiment" VARCHAR,
  "confidence" VARCHAR,
  "roberta_raw_outputs" VARCHAR,
  "unnamed_7" VARCHAR  -- Unnamed: 7,
  "unnamed_8" VARCHAR  -- Unnamed: 8,
  "unnamed_9" VARCHAR  -- Unnamed: 9,
  "unnamed_10" VARCHAR  -- Unnamed: 10,
  "unnamed_11" VARCHAR  -- Unnamed: 11,
  "unnamed_12" VARCHAR  -- Unnamed: 12,
  "unnamed_13" VARCHAR  -- Unnamed: 13,
  "unnamed_14" DOUBLE  -- Unnamed: 14,
  "unnamed_15" VARCHAR  -- Unnamed: 15
);

Share link

Anyone who has the link will be able to view this.