Baselight

Company Ticker Tweets 2018-2023 NLP

Tweets, affecting companies from 2018 to 2023. Good for NLP researches.

@kaggle.h4t3h4k3rs_ticker_tweets_2018_2023

Loading...
Loading...

About this Dataset

Company Ticker Tweets 2018-2023 NLP

Dataset was collected on 23 April 2023.

Contains tweet, firstly filtered by likes count and company tickers.

Tweets dates: 2018-01-01 - 2023-01-01

Also added preprocessed and segmented versions.

Feel free to request other companies.

Tables

Tweets Merged

@kaggle.h4t3h4k3rs_ticker_tweets_2018_2023.tweets_merged
  • 12.16 MB
  • 104509 rows
  • 9 columns
Loading...

CREATE TABLE tweets_merged (
  "unnamed_0" BIGINT,
  "id" BIGINT,
  "date" DOUBLE,
  "text" VARCHAR,
  "likes" BIGINT,
  "retweets" BIGINT,
  "replies" BIGINT,
  "quotes" BIGINT,
  "ticker" VARCHAR
);

Tweets Preprocessed

@kaggle.h4t3h4k3rs_ticker_tweets_2018_2023.tweets_preprocessed
  • 8.26 MB
  • 104509 rows
  • 10 columns
Loading...

CREATE TABLE tweets_preprocessed (
  "unnamed_0" BIGINT,
  "id" BIGINT,
  "date" DOUBLE,
  "text" VARCHAR,
  "likes" BIGINT,
  "retweets" BIGINT,
  "replies" BIGINT,
  "quotes" BIGINT,
  "ticker" VARCHAR,
  "hashtag" VARCHAR
);

Tweets Segmented

@kaggle.h4t3h4k3rs_ticker_tweets_2018_2023.tweets_segmented
  • 14.45 MB
  • 104509 rows
  • 12 columns
Loading...

CREATE TABLE tweets_segmented (
  "unnamed_0" BIGINT,
  "id" BIGINT,
  "date" DOUBLE,
  "text" VARCHAR,
  "likes" BIGINT,
  "retweets" BIGINT,
  "replies" BIGINT,
  "quotes" BIGINT,
  "ticker" VARCHAR,
  "hashtag" VARCHAR,
  "segmentedtext" VARCHAR,
  "segmented" VARCHAR
);

Tweets Segmented Toned

@kaggle.h4t3h4k3rs_ticker_tweets_2018_2023.tweets_segmented_toned
  • 14.07 MB
  • 104509 rows
  • 12 columns
Loading...

CREATE TABLE tweets_segmented_toned (
  "id" BIGINT,
  "date" BIGINT,
  "text" VARCHAR,
  "likes" BIGINT,
  "retweets" BIGINT,
  "replies" BIGINT,
  "quotes" BIGINT,
  "ticker" VARCHAR,
  "hashtag" VARCHAR,
  "segmentedtext" VARCHAR,
  "segmented" VARCHAR,
  "tone" VARCHAR
);

Tweets Segmented Toned Priced

@kaggle.h4t3h4k3rs_ticker_tweets_2018_2023.tweets_segmented_toned_priced
  • 14.19 MB
  • 104509 rows
  • 13 columns
Loading...

CREATE TABLE tweets_segmented_toned_priced (
  "id" BIGINT,
  "date" BIGINT,
  "text" VARCHAR,
  "likes" BIGINT,
  "retweets" BIGINT,
  "replies" BIGINT,
  "quotes" BIGINT,
  "ticker" VARCHAR,
  "hashtag" VARCHAR,
  "segmentedtext" VARCHAR,
  "segmented" VARCHAR,
  "tone" VARCHAR,
  "price" DOUBLE
);

Share link

Anyone who has the link will be able to view this.