Over 400K reddit submissions across 4 most visited subreddits
Dataset Description
This dataset consists of over 400K Reddit posts scraped over 4 subreddits: r/technology, r/worldnews, r/entertainment and r/sports. The data has NOT been cleaned for duplicates, advertisements and deleted posts. The data has been collected by using Pushshift API. The purpose of this dataset is to perform a NER trend analysis and sentiment analysis of most sensitive topics on r/worldnews.
I will upload the revised dataset soon.
Related Datasets
-
Reddit Data Huge
@kaggle