Company Ticker Tweets 2018-2023 NLP
Tweets, affecting companies from 2018 to 2023. Good for NLP researches.
@kaggle.h4t3h4k3rs_ticker_tweets_2018_2023
Tweets, affecting companies from 2018 to 2023. Good for NLP researches.
@kaggle.h4t3h4k3rs_ticker_tweets_2018_2023
Dataset was collected on 23 April 2023.
Contains tweet, firstly filtered by likes count and company tickers.
Tweets dates: 2018-01-01 - 2023-01-01
Also added preprocessed and segmented versions.
Feel free to request other companies.
CREATE TABLE tweets_merged (
"unnamed_0" BIGINT -- Unnamed: 0,
"id" BIGINT,
"date" DOUBLE,
"text" VARCHAR,
"likes" BIGINT,
"retweets" BIGINT,
"replies" BIGINT,
"quotes" BIGINT,
"ticker" VARCHAR
);CREATE TABLE tweets_preprocessed (
"unnamed_0" BIGINT -- Unnamed: 0,
"id" BIGINT,
"date" DOUBLE,
"text" VARCHAR,
"likes" BIGINT,
"retweets" BIGINT,
"replies" BIGINT,
"quotes" BIGINT,
"ticker" VARCHAR,
"hashtag" VARCHAR
);CREATE TABLE tweets_segmented (
"unnamed_0" BIGINT -- Unnamed: 0,
"id" BIGINT,
"date" DOUBLE,
"text" VARCHAR,
"likes" BIGINT,
"retweets" BIGINT,
"replies" BIGINT,
"quotes" BIGINT,
"ticker" VARCHAR,
"hashtag" VARCHAR,
"segmentedtext" VARCHAR -- SegmentedText#,
"segmented" VARCHAR -- Segmented#
);CREATE TABLE tweets_segmented_toned (
"id" BIGINT,
"date" BIGINT,
"text" VARCHAR,
"likes" BIGINT,
"retweets" BIGINT,
"replies" BIGINT,
"quotes" BIGINT,
"ticker" VARCHAR,
"hashtag" VARCHAR,
"segmentedtext" VARCHAR -- SegmentedText#,
"segmented" VARCHAR -- Segmented#,
"tone" VARCHAR
);CREATE TABLE tweets_segmented_toned_priced (
"id" BIGINT,
"date" BIGINT,
"text" VARCHAR,
"likes" BIGINT,
"retweets" BIGINT,
"replies" BIGINT,
"quotes" BIGINT,
"ticker" VARCHAR,
"hashtag" VARCHAR,
"segmentedtext" VARCHAR -- SegmentedText#,
"segmented" VARCHAR -- Segmented#,
"tone" VARCHAR,
"price" DOUBLE
);Anyone who has the link will be able to view this.