A Dataset of Top 100 Reddit Posts for Daily Updates: Context, Creation, and Ins
Dataset Description
The dataset titled "Top 100 Reddit Posts (daily update)" is created using a Python script that scrapes data from the top 10 posts in each of the following subreddits: 'AskReddit', 'worldnews', 'science', 'technology', 'politics', 'movies', 'sports', 'gaming', 'books', and 'explainlikeimfive'
The dataset contains the following columns:
- Title
- Author
- Subreddit
- Score
- Permalink
- Creation Time
- Number of Comments
- Upvote Ratio
- URL
- Post ID
- Is Original Content
- Flair
- Comments
The inspiration behind creating this dataset is to gather data from popular Reddit posts, which can be used for various data analysis and machine learning tasks, such as sentiment analysis, topic modeling, or predicting post popularity based on features like title, author, and subreddit.
Related Datasets
-
Reddit Data Huge
@kaggle