Baselight

Twitter Sentiment Analysis And Word Embeddings

A dataset containing 940k+ tweets about Messi&Ronaldo, and GloVe+Sentiment140

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings

Loading...
Loading...

About this Dataset

Twitter Sentiment Analysis And Word Embeddings

Content

TSA model v3

The third version of the Sentiment Analysis model trained in this notebook.

Dataset folder

  • GloVe embeddings
  • Sentiment140 Dataset (train+test)

Root path

  • Messi and Ronaldo tweets : raw-fetched tweets about Messi and Ronaldo
  • Cleaned Messi and Ronaldo tweets : the same tweets, but with their content cleaned (extra spaces and link removal, etc). Consult the Data cleaning notebook for more details about the process.
  • Predictions : inferences results on the aforementioned datasets. Learn more
  • Final dataset : geo-location retrieval and aggregations performed on the predictions result

Tables

Cleaned Messi Tweets

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.cleaned_messi_tweets
  • 35.56 MB
  • 465223 rows
  • 10 columns
Loading...

CREATE TABLE cleaned_messi_tweets (
  "tweet_id" BIGINT,
  "author_id" BIGINT,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" BIGINT,
  "like_count" BIGINT,
  "quote_count" BIGINT
);

Cleaned Ronaldo Tweets

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.cleaned_ronaldo_tweets
  • 39.47 MB
  • 501926 rows
  • 10 columns
Loading...

CREATE TABLE cleaned_ronaldo_tweets (
  "tweet_id" BIGINT,
  "author_id" BIGINT,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" DOUBLE,
  "like_count" DOUBLE,
  "quote_count" DOUBLE
);

Messi Final

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.messi_final
  • 37.23 MB
  • 435163 rows
  • 12 columns
Loading...

CREATE TABLE messi_final (
  "tweet_id" BIGINT,
  "author_id" BIGINT,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" BIGINT,
  "like_count" BIGINT,
  "quote_count" BIGINT,
  "prediction" DOUBLE,
  "label" VARCHAR
);

Messi Tweets

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.messi_tweets
  • 56.44 MB
  • 502785 rows
  • 10 columns
Loading...

CREATE TABLE messi_tweets (
  "tweet_id" VARCHAR,
  "author_id" VARCHAR,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" VARCHAR,
  "like_count" VARCHAR,
  "quote_count" VARCHAR
);

Predictions Messi

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.predictions_messi
  • 37.26 MB
  • 435163 rows
  • 12 columns
Loading...

CREATE TABLE predictions_messi (
  "tweet_id" BIGINT,
  "author_id" BIGINT,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" BIGINT,
  "like_count" BIGINT,
  "quote_count" BIGINT,
  "prediction" DOUBLE,
  "label" VARCHAR
);

Predictions Ronaldo

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.predictions_ronaldo
  • 41 MB
  • 471846 rows
  • 12 columns
Loading...

CREATE TABLE predictions_ronaldo (
  "tweet_id" BIGINT,
  "author_id" BIGINT,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" DOUBLE,
  "like_count" DOUBLE,
  "quote_count" DOUBLE,
  "prediction" DOUBLE,
  "label" VARCHAR
);

Ronaldo Final

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.ronaldo_final
  • 40.95 MB
  • 471846 rows
  • 12 columns
Loading...

CREATE TABLE ronaldo_final (
  "tweet_id" BIGINT,
  "author_id" BIGINT,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" DOUBLE,
  "like_count" DOUBLE,
  "quote_count" DOUBLE,
  "prediction" DOUBLE,
  "label" VARCHAR
);

Ronaldo Tweets

@kaggle.ibrahimserouis99_twitter_sentiment_analysis_and_word_embeddings.ronaldo_tweets
  • 57.96 MB
  • 502635 rows
  • 10 columns
Loading...

CREATE TABLE ronaldo_tweets (
  "tweet_id" VARCHAR,
  "author_id" VARCHAR,
  "content" VARCHAR,
  "lang" VARCHAR,
  "date" VARCHAR,
  "source" VARCHAR,
  "geo" VARCHAR,
  "retweet_count" VARCHAR,
  "like_count" VARCHAR,
  "quote_count" VARCHAR
);

Share link

Anyone who has the link will be able to view this.