Reddit: /r/gardening
Analyzing Posts from the Gardening Subreddit
By Reddit [source]
About this dataset
This rich dataset captures the gardening activity of Reddit users in the Gardening Subreddit. By examining this data, we can gain valuable insight into how gardeners from around the world are discussing and carrying out their various projects. The dataset is composed of posts that include a title, score (which indicates the community response), URL, comments number, created date and time, body content containing details about gardening projects or advice shared by members of the community, and post timestamp. Through careful analysis of this data set, researchers can identify trends in popular gardening topics as well as pinpoint what issues are being discussed at any given time by members of this virtual gardeners' hub
More Datasets
For more datasets, click here.
Featured Notebooks
- 🚨 Your notebook can be here! 🚨!
How to use the dataset
This dataset provides unique insights into the gardening community on Reddit and can be used to explore gardening trends. In this dataset, we have posts from the Reddit gardening subreddit with features such as title, score, comment_num, created date, body, and timestamp. With these data points we are able to tell stories about what people are talking about in the world of gardening.
You can use this dataset to answer questions such as:
- Which topics were being discussed most often?
- What were some of the most popular projects among gardeners?
- Are there any common trends or patterns evident in posts?
- Are people more engaged/interested in certain types of content or topics than others?
Additionally, you could compare different aspects over time by creating visualizations or using Plotly. For example you could use plotly to create a line graph showing how post scores have changed over time or which tags received higher scores compared to others during certain time frames. You can also do comparative analyses between indicators such as user activity levels vs reposts/score etc., and examine correlations between user activity levels and post engagement/interaction etc.
Finally you can analyse correlations between titles and body text for interesting topics/trends that would be difficult for machines to identify without human input. By filtering out repetitive posts (or low engagement ones) you may uncover relationships that would otherwise remain hidden!
Research Ideas
- Analyzing gardening trends over time by comparing the most popular posts from different time periods.
- Examining the relationship between total number of comments and score to measure post popularity and engagement on the Gardening Subreddit.
- Identifying keywords in post titles to categorize topics discussed within the Reddit Gardening community, such as organic gardening or landscaping techniques
Acknowledgements
If you use this dataset in your research, please credit the original authors.
Data Source
License
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
Columns
File: gardening.csv
Column name |
Description |
title |
The title of the post. (String) |
score |
The number of upvotes the post has received. (Integer) |
url |
The URL of the post. (String) |
comms_num |
The number of comments the post has received. (Integer) |
created |
The date the post was created. (Date) |
body |
The body of the post. (String) |
timestamp |
The timestamp of the post. (Integer) |
Acknowledgements
If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit Reddit.