Baselight

Reddit Data Huge

Dataset containing Reddit Posts and Comments from various different subreddits.

@kaggle.prakharrathi25_reddit_data_huge

Loading...
Loading...

About this Dataset

Reddit Data Huge

What is Reddit?

Reddit is a collection of forums where people can share news and content as a thread or comment on other people’s posts. Reddit is broken up into more than a million communities known as “subreddits,” each of which covers a different topic. The name of a subreddit begins with /r/, which is part of the URLs that Reddit uses. For example, /r/nba is a subreddit where people talk about the National Basketball Association, while /r/boardgames is a subreddit for people to discuss board games.

Content

In this dataset, I have added data from many different subreddits. This will act as an NLP gold mine for Social Media Analysis. This can help people understand what the youth is talking about.

Acknowledgements

I have collected it on my own using this article.

Tables

Student Affairs

@kaggle.prakharrathi25_reddit_data_huge.student_affairs
  • 574.56 KB
  • 986 rows
  • 11 columns
Loading...

CREATE TABLE student_affairs (
  "id" VARCHAR,
  "title" VARCHAR,
  "body" VARCHAR,
  "subreddit" VARCHAR,
  "upvotes" BIGINT,
  "url" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Suicidewatch Data

@kaggle.prakharrathi25_reddit_data_huge.suicidewatch_data
  • 810.51 KB
  • 986 rows
  • 12 columns
Loading...

CREATE TABLE suicidewatch_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Teenagers

@kaggle.prakharrathi25_reddit_data_huge.teenagers
  • 211.64 KB
  • 957 rows
  • 12 columns
Loading...

CREATE TABLE teenagers (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Whatsbotheringyou Reddit Data

@kaggle.prakharrathi25_reddit_data_huge.whatsbotheringyou_reddit_data
  • 1.26 MB
  • 996 rows
  • 12 columns
Loading...

CREATE TABLE whatsbotheringyou_reddit_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Youthandgov Data

@kaggle.prakharrathi25_reddit_data_huge.youthandgov_data
  • 27.65 KB
  • 46 rows
  • 12 columns
Loading...

CREATE TABLE youthandgov_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Youthknows Data

@kaggle.prakharrathi25_reddit_data_huge.youthknows_data
  • 28.04 KB
  • 46 rows
  • 12 columns
Loading...

CREATE TABLE youthknows_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Youthrights Data

@kaggle.prakharrathi25_reddit_data_huge.youthrights_data
  • 332.34 KB
  • 934 rows
  • 12 columns
Loading...

CREATE TABLE youthrights_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Share link

Anyone who has the link will be able to view this.