Baselight
Sign In
kaggle

The Reddit Dataset Dataset

Kaggle
•

@kaggle.pavellexyr_the_reddit_dataset_dataset

Loading...
Loading...

A meta dataset of Reddit's own /r/datasets community.

Dataset Description

Context

Datasets... In a way, the Kaggle community is built around them. You can't analyze data without having it. Here, we aim to create a meta-corpus of datasets posted to Reddit. A dataset dataset, if you will.

Content

The following dataset is the comprehensive corpus of all the posts and comments made on Reddit's /r/datasets board, from its inception all the way to the first of March, 2022.

The dataset was procured using SocialGrep.

To preserve users' anonymity and to prevent targeted harassment, the data does not include usernames.

Acknowledgements

We would like to thank Chris Liverani for generously providing the cover image for this dataset.

Inspiration

Datasets are nice - we like our data.


Related Datasets

Share link

Anyone who has the link will be able to view this.