Cleaning dataset for Kaggle Competition "Real or Not? NLP with Disaster Tweets"
Dataset Description
Context
The data obtained by clearing the Getting Started Prediction Competition "Real or Not? NLP with Disaster Tweets" data is the result of a public notebook "NLP with Disaster Tweets - EDA and Cleaning data".
In the future, I plan to improve cleaning and update the dataset
Content
id - a unique identifier for each tweet
text - the text of the tweet
location - the location the tweet was sent from (may be blank)
keyword - a particular keyword from the tweet (may be blank)
target - in train.csv only, this denotes whether a tweet is about a real disaster (1) or not (0)
Acknowledgements
Thanks to Kaggle team for this Competition "Real or Not? NLP with Disaster Tweets" and its datasets (this dataset was created by the company figure-eight and originally shared on their ‘Data For Everyone’ website here. Tweet source: https://twitter.com/AnyOtherAnnaK/status/629195955506708480).
Thanks to web-site Ambulance services drive, strive to keep you alive for your image, which is very similar to the image of the contest "Real or Not? NLP with Disaster Tweets" and which I used as the image of my dataset
Inspiration
You are predicting whether a given tweet is about a real disaster or not. If so, predict a 1. If not, predict a 0.
Related Datasets
-
Disaster Tweets
@kaggle
-
Natural Hazards Data
@owid
-
Eucalyptus Growth And Environmental Data
@euremarkable