Reddit IRL - Relatable Humour Dataset
Data from the humour subreddits /r/meirl and /r/me_irl, up to Apr 1 2022.
@kaggle.pavellexyr_the_reddit_irl_dataset
Data from the humour subreddits /r/meirl and /r/me_irl, up to Apr 1 2022.
@kaggle.pavellexyr_the_reddit_irl_dataset
Humour is always a fickle beast - internet humour, doubly so. From nonsensical surreal memes to ruthless self-deprecation, the ways of making people laugh are spontaneous and unpredictable. This isn't our first run-in with Reddit's unique sense of humour, and yet to complete the picture, we would like to present you with a comprehensive dataset of the subreddits /r/meirl and /r/me_irl, monitored all the way until April 1st, 2022.
This dataset contains all the posts and comments on the subreddits /r/meirl and /r/me_irl, from creation date all the way to 2022-04-01.
The dataset was procured using SocialGrep.
To preserve users' anonymity and to prevent targeted harassment, the data does not include usernames.
We would like to thank Dominik Vanyi for generously providing us with the cover image for this dataset.
What makes a post funny? Self-deprecating humour treads a fine line - get it right and you get thousands of upvotes, make a faux pas - and you're down to zero. We invite you to explore this topic with our new dataset.
CREATE TABLE the_reddit_irl_dataset_comments (
"type" VARCHAR,
"id" VARCHAR,
"subreddit_id" VARCHAR,
"subreddit_name" VARCHAR,
"subreddit_nsfw" BOOLEAN,
"created_utc" BIGINT,
"permalink" VARCHAR,
"body" VARCHAR,
"sentiment" DOUBLE,
"score" BIGINT
);CREATE TABLE the_reddit_irl_dataset_posts (
"type" VARCHAR,
"id" VARCHAR,
"subreddit_id" VARCHAR,
"subreddit_name" VARCHAR,
"subreddit_nsfw" BOOLEAN,
"created_utc" BIGINT,
"permalink" VARCHAR,
"domain" VARCHAR,
"url" VARCHAR,
"selftext" VARCHAR,
"title" VARCHAR,
"score" BIGINT
);Anyone who has the link will be able to view this.