Baselight

Top 100 Reddit Posts (daily Update)

A Dataset of Top 100 Reddit Posts for Daily Updates: Context, Creation, and Ins

@kaggle.prathamsaraf1389_top_100_reddit_posts_daily_update

Loading...
Loading...

About this Dataset

Top 100 Reddit Posts (daily Update)

The dataset titled "Top 100 Reddit Posts (daily update)" is created using a Python script that scrapes data from the top 10 posts in each of the following subreddits: 'AskReddit', 'worldnews', 'science', 'technology', 'politics', 'movies', 'sports', 'gaming', 'books', and 'explainlikeimfive'

The dataset contains the following columns:

  • Title
  • Author
  • Subreddit
  • Score
  • Permalink
  • Creation Time
  • Number of Comments
  • Upvote Ratio
  • URL
  • Post ID
  • Is Original Content
  • Flair
  • Comments
    The inspiration behind creating this dataset is to gather data from popular Reddit posts, which can be used for various data analysis and machine learning tasks, such as sentiment analysis, topic modeling, or predicting post popularity based on features like title, author, and subreddit.

Tables

Top Reddit Posts

@kaggle.prathamsaraf1389_top_100_reddit_posts_daily_update.top_reddit_posts
  • 482.14 MB
  • 13988 rows
  • 14 columns
Loading...

CREATE TABLE top_reddit_posts (
  "unnamed_0" BIGINT,
  "title" VARCHAR,
  "author" VARCHAR,
  "subreddit" VARCHAR,
  "score" BIGINT,
  "permalink" VARCHAR,
  "creation_time" DOUBLE,
  "number_of_comments" BIGINT,
  "upvote_ratio" DOUBLE,
  "url" VARCHAR,
  "post_id" VARCHAR,
  "is_original_content" BOOLEAN,
  "flair" VARCHAR,
  "comments" VARCHAR
);

Share link

Anyone who has the link will be able to view this.