One million questions from /r/AskReddit, going back from September 2021.
Dataset Description
Context
Ah, questions. One of the most important parts of natural dialogue. Automated question answering has been a long-standing problem in the NLP field. To help solve it, we present you with this dataset.
Content
The following dataset comprises one million questions from /r/AskReddit, procured using SocialGrep.
The questions are labelled with date of creation and their score.
Acknowledgements
We would like to thank Etienne Girardet for generously providing us with a background image for this dataset.
Inspiration
- What makes a popular Reddit question?
- What makes a good Reddit question?
- Can Reddit teach us more about how to ask questions properly?
Related Datasets
-
Ten Million Reddit Answers
@kaggle
-
Wars On Territory
@owid