COVID-19 All Vaccines Tweets
Tweets about all COVID-19 Vaccines
@kaggle.gpreda_all_covid19_vaccines_tweets
Tweets about all COVID-19 Vaccines
@kaggle.gpreda_all_covid19_vaccines_tweets
I collect recent tweets about the COVID-19 vaccines used in entire world on large scale, as following:
The data is collected using tweepy Python package to access Twitter API. For each of the vaccine I use relevant search term (most frequently used in Twitter to refer to the respective vaccine)
Initial data was merged from tweets about Pfizer/BioNTech vaccine. I added then tweets from Sinopharm, Sinovac (both Chinese-produced vaccines), Moderna, Oxford/Astra-Zeneca, Covaxin and Sputnik V vaccines. The collection was in the first days twice a day, until I identified approximatively the new tweets quota and then collection (for all vaccines) stabilized at once a day, during morning hours (GMT).
You can perform multiple operations on the vaccines tweets. Here are few possible suggestions:
CREATE TABLE vaccination_all_tweets (
"id" BIGINT,
"user_name" VARCHAR,
"user_location" VARCHAR,
"user_description" VARCHAR,
"user_created" TIMESTAMP,
"user_followers" BIGINT,
"user_friends" BIGINT,
"user_favourites" BIGINT,
"user_verified" BOOLEAN,
"date" TIMESTAMP,
"text" VARCHAR,
"hashtags" VARCHAR,
"source" VARCHAR,
"retweets" BIGINT,
"favorites" BIGINT,
"is_retweet" BOOLEAN
);Anyone who has the link will be able to view this.