A meta dataset of Reddit's own /r/datasets community.
Dataset Description
Context
Datasets... In a way, the Kaggle community is built around them. You can't analyze data without having it. Here, we aim to create a meta-corpus of datasets posted to Reddit. A dataset dataset, if you will.
Content
The following dataset is the comprehensive corpus of all the posts and comments made on Reddit's /r/datasets board, from its inception all the way to the first of March, 2022.
The dataset was procured using SocialGrep.
To preserve users' anonymity and to prevent targeted harassment, the data does not include usernames.
Acknowledgements
We would like to thank Chris Liverani for generously providing the cover image for this dataset.
Inspiration
Datasets are nice - we like our data.
Related Datasets
-
The Reddit /r/Place Dataset
@kaggle
-
Eucalyptus Growth And Environmental Data
@euremarkable