Baselight

Trending YouTube Video Statistics

Daily statistics for trending YouTube videos

@kaggle.datasnaek_youtube_new

Loading...
Loading...

About this Dataset

Trending YouTube Video Statistics

UPDATE: Source code used for collecting this data released here

Context

YouTube (the world-famous video sharing website) maintains a list of the top trending videos on the platform. According to Variety magazine, “To determine the year’s top-trending videos, YouTube uses a combination of factors including measuring users interactions (number of views, shares, comments and likes). Note that they’re not the most-viewed videos overall for the calendar year”. Top performers on the YouTube trending list are music videos (such as the famously virile “Gangam Style”), celebrity and/or reality TV performances, and the random dude-with-a-camera viral videos that YouTube is well-known for.

This dataset is a daily record of the top trending YouTube videos.

Note that this dataset is a structurally improved version of this dataset.

Content

This dataset includes several months (and counting) of data on daily trending YouTube videos. Data is included for the US, GB, DE, CA, and FR regions (USA, Great Britain, Germany, Canada, and France, respectively), with up to 200 listed trending videos per day.

EDIT: Now includes data from RU, MX, KR, JP and IN regions (Russia, Mexico, South Korea, Japan and India respectively) over the same time period.

Each region’s data is in a separate file. Data includes the video title, channel title, publish time, tags, views, likes and dislikes, description, and comment count.

The data also includes a category_id field, which varies between regions. To retrieve the categories for a specific video, find it in the associated JSON. One such file is included for each of the five regions in the dataset.

For more information on specific columns in the dataset refer to the column metadata.

Acknowledgements

This dataset was collected using the YouTube API.

Inspiration

Possible uses for this dataset could include:

  • Sentiment analysis in a variety of forms
  • Categorising YouTube videos based on their comments and statistics.
  • Training ML algorithms like RNNs to generate their own YouTube comments.
  • Analysing what factors affect how popular a YouTube video will be.
  • Statistical analysis over time .

For further inspiration, see the kernels on this dataset!

Tables

Cavideos

@kaggle.datasnaek_youtube_new.cavideos
  • 33.4 MB
  • 40881 rows
  • 16 columns
Loading...

CREATE TABLE cavideos (
  "video_id" VARCHAR,
  "trending_date" VARCHAR,
  "title" VARCHAR,
  "channel_title" VARCHAR,
  "category_id" BIGINT,
  "publish_time" VARCHAR,
  "tags" VARCHAR,
  "views" BIGINT,
  "likes" BIGINT,
  "dislikes" BIGINT,
  "comment_count" BIGINT,
  "thumbnail_link" VARCHAR,
  "comments_disabled" BOOLEAN,
  "ratings_disabled" BOOLEAN,
  "video_error_or_removed" BOOLEAN,
  "description" VARCHAR
);

Devideos

@kaggle.datasnaek_youtube_new.devideos
  • 34.39 MB
  • 40840 rows
  • 16 columns
Loading...

CREATE TABLE devideos (
  "video_id" VARCHAR,
  "trending_date" VARCHAR,
  "title" VARCHAR,
  "channel_title" VARCHAR,
  "category_id" BIGINT,
  "publish_time" VARCHAR,
  "tags" VARCHAR,
  "views" BIGINT,
  "likes" BIGINT,
  "dislikes" BIGINT,
  "comment_count" BIGINT,
  "thumbnail_link" VARCHAR,
  "comments_disabled" BOOLEAN,
  "ratings_disabled" BOOLEAN,
  "video_error_or_removed" BOOLEAN,
  "description" VARCHAR
);

Frvideos

@kaggle.datasnaek_youtube_new.frvideos
  • 27.19 MB
  • 40724 rows
  • 16 columns
Loading...

CREATE TABLE frvideos (
  "video_id" VARCHAR,
  "trending_date" VARCHAR,
  "title" VARCHAR,
  "channel_title" VARCHAR,
  "category_id" BIGINT,
  "publish_time" VARCHAR,
  "tags" VARCHAR,
  "views" BIGINT,
  "likes" BIGINT,
  "dislikes" BIGINT,
  "comment_count" BIGINT,
  "thumbnail_link" VARCHAR,
  "comments_disabled" BOOLEAN,
  "ratings_disabled" BOOLEAN,
  "video_error_or_removed" BOOLEAN,
  "description" VARCHAR
);

Gbvideos

@kaggle.datasnaek_youtube_new.gbvideos
  • 18.41 MB
  • 38916 rows
  • 16 columns
Loading...

CREATE TABLE gbvideos (
  "video_id" VARCHAR,
  "trending_date" VARCHAR,
  "title" VARCHAR,
  "channel_title" VARCHAR,
  "category_id" BIGINT,
  "publish_time" VARCHAR,
  "tags" VARCHAR,
  "views" BIGINT,
  "likes" BIGINT,
  "dislikes" BIGINT,
  "comment_count" BIGINT,
  "thumbnail_link" VARCHAR,
  "comments_disabled" BOOLEAN,
  "ratings_disabled" BOOLEAN,
  "video_error_or_removed" BOOLEAN,
  "description" VARCHAR
);

Invideos

@kaggle.datasnaek_youtube_new.invideos
  • 26.66 MB
  • 37352 rows
  • 16 columns
Loading...

CREATE TABLE invideos (
  "video_id" VARCHAR,
  "trending_date" VARCHAR,
  "title" VARCHAR,
  "channel_title" VARCHAR,
  "category_id" BIGINT,
  "publish_time" VARCHAR,
  "tags" VARCHAR,
  "views" BIGINT,
  "likes" BIGINT,
  "dislikes" BIGINT,
  "comment_count" BIGINT,
  "thumbnail_link" VARCHAR,
  "comments_disabled" BOOLEAN,
  "ratings_disabled" BOOLEAN,
  "video_error_or_removed" BOOLEAN,
  "description" VARCHAR
);

Usvideos

@kaggle.datasnaek_youtube_new.usvideos
  • 24.47 MB
  • 40949 rows
  • 16 columns
Loading...

CREATE TABLE usvideos (
  "video_id" VARCHAR,
  "trending_date" VARCHAR,
  "title" VARCHAR,
  "channel_title" VARCHAR,
  "category_id" BIGINT,
  "publish_time" VARCHAR,
  "tags" VARCHAR,
  "views" BIGINT,
  "likes" BIGINT,
  "dislikes" BIGINT,
  "comment_count" BIGINT,
  "thumbnail_link" VARCHAR,
  "comments_disabled" BOOLEAN,
  "ratings_disabled" BOOLEAN,
  "video_error_or_removed" BOOLEAN,
  "description" VARCHAR
);

Share link

Anyone who has the link will be able to view this.