Baselight

Reddit Data Huge

Dataset containing Reddit Posts and Comments from various different subreddits.

@kaggle.prakharrathi25_reddit_data_huge

Loading...
Loading...

About this Dataset

Reddit Data Huge

What is Reddit?

Reddit is a collection of forums where people can share news and content as a thread or comment on other people’s posts. Reddit is broken up into more than a million communities known as “subreddits,” each of which covers a different topic. The name of a subreddit begins with /r/, which is part of the URLs that Reddit uses. For example, /r/nba is a subreddit where people talk about the National Basketball Association, while /r/boardgames is a subreddit for people to discuss board games.

Content

In this dataset, I have added data from many different subreddits. This will act as an NLP gold mine for Social Media Analysis. This can help people understand what the youth is talking about.

Acknowledgements

I have collected it on my own using this article.

Tables

Adviceforteens Data

@kaggle.prakharrathi25_reddit_data_huge.adviceforteens_data
  • 943.46 KB
  • 818 rows
  • 12 columns
Loading...

CREATE TABLE adviceforteens_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Anxiety Reddit Data

@kaggle.prakharrathi25_reddit_data_huge.anxiety_reddit_data
  • 537.37 KB
  • 1000 rows
  • 12 columns
Loading...

CREATE TABLE anxiety_reddit_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Applyingtocollege Data

@kaggle.prakharrathi25_reddit_data_huge.applyingtocollege_data
  • 788.64 KB
  • 995 rows
  • 12 columns
Loading...

CREATE TABLE applyingtocollege_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Astrology Data

@kaggle.prakharrathi25_reddit_data_huge.astrology_data
  • 539.42 KB
  • 983 rows
  • 12 columns
Loading...

CREATE TABLE astrology_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Business Data

@kaggle.prakharrathi25_reddit_data_huge.business_data
  • 275.67 KB
  • 989 rows
  • 12 columns
Loading...

CREATE TABLE business_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Careerguidance Data

@kaggle.prakharrathi25_reddit_data_huge.careerguidance_data
  • 1.03 MB
  • 1000 rows
  • 12 columns
Loading...

CREATE TABLE careerguidance_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

College

@kaggle.prakharrathi25_reddit_data_huge.college
  • 637.48 KB
  • 884 rows
  • 11 columns
Loading...

CREATE TABLE college (
  "id" VARCHAR,
  "title" VARCHAR,
  "body" VARCHAR,
  "subreddit" VARCHAR,
  "upvotes" BIGINT,
  "url" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

College Data

@kaggle.prakharrathi25_reddit_data_huge.college_data
  • 654.53 KB
  • 885 rows
  • 12 columns
Loading...

CREATE TABLE college_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Colombia Reddit Data

@kaggle.prakharrathi25_reddit_data_huge.colombia_reddit_data
  • 219.73 KB
  • 1000 rows
  • 12 columns
Loading...

CREATE TABLE colombia_reddit_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Computer Science Data

@kaggle.prakharrathi25_reddit_data_huge.computer_science_data
  • 505.11 KB
  • 984 rows
  • 12 columns
Loading...

CREATE TABLE computer_science_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Covid19 Data

@kaggle.prakharrathi25_reddit_data_huge.covid19_data
  • 316.26 KB
  • 992 rows
  • 12 columns
Loading...

CREATE TABLE covid19_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Dating Data

@kaggle.prakharrathi25_reddit_data_huge.dating_data
  • 1.05 MB
  • 993 rows
  • 12 columns
Loading...

CREATE TABLE dating_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Depression Reddit Data

@kaggle.prakharrathi25_reddit_data_huge.depression_reddit_data
  • 827.16 KB
  • 1000 rows
  • 12 columns
Loading...

CREATE TABLE depression_reddit_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Employment Reddit Data

@kaggle.prakharrathi25_reddit_data_huge.employment_reddit_data
  • 2.15 MB
  • 2495 rows
  • 12 columns
Loading...

CREATE TABLE employment_reddit_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Engineering Data

@kaggle.prakharrathi25_reddit_data_huge.engineering_data
  • 300.97 KB
  • 994 rows
  • 12 columns
Loading...

CREATE TABLE engineering_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Entrepreneur Data

@kaggle.prakharrathi25_reddit_data_huge.entrepreneur_data
  • 2.79 MB
  • 999 rows
  • 12 columns
Loading...

CREATE TABLE entrepreneur_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Feminism Data

@kaggle.prakharrathi25_reddit_data_huge.feminism_data
  • 235.59 KB
  • 1000 rows
  • 12 columns
Loading...

CREATE TABLE feminism_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Gradschool Data

@kaggle.prakharrathi25_reddit_data_huge.gradschool_data
  • 616.01 KB
  • 769 rows
  • 12 columns
Loading...

CREATE TABLE gradschool_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

High School

@kaggle.prakharrathi25_reddit_data_huge.high_school
  • 440.2 KB
  • 998 rows
  • 11 columns
Loading...

CREATE TABLE high_school (
  "id" VARCHAR,
  "title" VARCHAR,
  "body" VARCHAR,
  "subreddit" VARCHAR,
  "upvotes" BIGINT,
  "url" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Kidsrights Data

@kaggle.prakharrathi25_reddit_data_huge.kidsrights_data
  • 116.13 KB
  • 471 rows
  • 12 columns
Loading...

CREATE TABLE kidsrights_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Merged Reddit Data

@kaggle.prakharrathi25_reddit_data_huge.merged_reddit_data
  • 3.17 MB
  • 8401 rows
  • 8 columns
Loading...

CREATE TABLE merged_reddit_data (
  "unnamed_0" VARCHAR,
  "id" VARCHAR,
  "flair" VARCHAR,
  "subreddit" VARCHAR,
  "text" VARCHAR,
  "sentiment" VARCHAR,
  "creation_date" TIMESTAMP,
  "upvotes" BIGINT
);

Mexico Spanishlanguage Reddit Data

@kaggle.prakharrathi25_reddit_data_huge.mexico_spanishlanguage_reddit_data
  • 197.54 KB
  • 998 rows
  • 12 columns
Loading...

CREATE TABLE mexico_spanishlanguage_reddit_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Neutralpolitics Data

@kaggle.prakharrathi25_reddit_data_huge.neutralpolitics_data
  • 1.2 MB
  • 1000 rows
  • 12 columns
Loading...

CREATE TABLE neutralpolitics_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Povertyfinance Data

@kaggle.prakharrathi25_reddit_data_huge.povertyfinance_data
  • 653.51 KB
  • 990 rows
  • 12 columns
Loading...

CREATE TABLE povertyfinance_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Science Data

@kaggle.prakharrathi25_reddit_data_huge.science_data
  • 373.15 KB
  • 991 rows
  • 12 columns
Loading...

CREATE TABLE science_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "is_original" BOOLEAN,
  "flair" VARCHAR,
  "num_comments" BIGINT,
  "title" VARCHAR,
  "subreddit" VARCHAR,
  "body" VARCHAR,
  "url" VARCHAR,
  "upvotes" BIGINT,
  "comments" VARCHAR,
  "creation_date" TIMESTAMP
);

Share link

Anyone who has the link will be able to view this.