Baselight

Twitter Sentiment Analysis And Word Embeddings

A dataset containing 940k+ tweets about Messi&Ronaldo, and GloVe+Sentiment140

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings

Loading...
Loading...

About this Dataset

Twitter Sentiment Analysis And Word Embeddings

Content

TSA model v3

The third version of the Sentiment Analysis model trained in this notebook.

Dataset folder

  • GloVe embeddings
  • Sentiment140 Dataset (train+test)

Root path

  • Messi and Ronaldo tweets : raw-fetched tweets about Messi and Ronaldo
  • Cleaned Messi and Ronaldo tweets : the same tweets, but with their content cleaned (extra spaces and link removal, etc). Consult the Data cleaning notebook for more details about the process.
  • Predictions : inferences results on the aforementioned datasets. Learn more
  • Final dataset : geo-location retrieval and aggregations performed on the predictions result

Tables

Cleaned Messi Tweets

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.cleaned_messi_tweets
  • 37.28 MB
  • 465,223 rows
  • 10 columns
Loading...
CREATE TABLE cleaned_messi_tweets (
  "tweet_id" BIGINT,
  "author_id" BIGINT,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" BIGINT,
  "like_count" BIGINT,
  "quote_count" BIGINT
);

Cleaned Ronaldo Tweets

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.cleaned_ronaldo_tweets
  • 41.39 MB
  • 501,926 rows
  • 10 columns
Loading...
CREATE TABLE cleaned_ronaldo_tweets (
  "tweet_id" BIGINT,
  "author_id" BIGINT,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" DOUBLE,
  "like_count" DOUBLE,
  "quote_count" DOUBLE
);

Messi Final

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.messi_final
  • 39.04 MB
  • 435,163 rows
  • 12 columns
Loading...
CREATE TABLE messi_final (
  "tweet_id" BIGINT,
  "author_id" BIGINT,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" BIGINT,
  "like_count" BIGINT,
  "quote_count" BIGINT,
  "prediction" DOUBLE,
  "label" VARCHAR
);

Messi Tweets

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.messi_tweets
  • 59.18 MB
  • 502,785 rows
  • 10 columns
Loading...
CREATE TABLE messi_tweets (
  "tweet_id" VARCHAR,
  "author_id" VARCHAR,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" VARCHAR,
  "like_count" VARCHAR,
  "quote_count" VARCHAR
);

Predictions Messi

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.predictions_messi
  • 39.07 MB
  • 435,163 rows
  • 12 columns
Loading...
CREATE TABLE predictions_messi (
  "tweet_id" BIGINT,
  "author_id" BIGINT,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" BIGINT,
  "like_count" BIGINT,
  "quote_count" BIGINT,
  "prediction" DOUBLE,
  "label" VARCHAR
);

Predictions Ronaldo

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.predictions_ronaldo
  • 42.99 MB
  • 471,846 rows
  • 12 columns
Loading...
CREATE TABLE predictions_ronaldo (
  "tweet_id" BIGINT,
  "author_id" BIGINT,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" DOUBLE,
  "like_count" DOUBLE,
  "quote_count" DOUBLE,
  "prediction" DOUBLE,
  "label" VARCHAR
);

Ronaldo Final

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.ronaldo_final
  • 42.94 MB
  • 471,846 rows
  • 12 columns
Loading...
CREATE TABLE ronaldo_final (
  "tweet_id" BIGINT,
  "author_id" BIGINT,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" DOUBLE,
  "like_count" DOUBLE,
  "quote_count" DOUBLE,
  "prediction" DOUBLE,
  "label" VARCHAR
);

Ronaldo Tweets

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.ronaldo_tweets
  • 60.78 MB
  • 502,635 rows
  • 10 columns
Loading...
CREATE TABLE ronaldo_tweets (
  "tweet_id" VARCHAR,
  "author_id" VARCHAR,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" VARCHAR,
  "like_count" VARCHAR,
  "quote_count" VARCHAR
);

Share link

Anyone who has the link will be able to view this.