Baselight

Netflix Data

Analyse Netflix's Shows & Movies Data with Python

@kaggle.rohitgrewal_netflix_data

Loading...
Loading...

About this Dataset

Netflix Data

📹Project Video available on YouTube - https://youtu.be/b7Kd0fLwgO4

📖 Get Python Data Analysis Self Study Notes - https://rzp.io/l/dslnotes239


This Netflix Dataset has information about the TV Shows and Movies available on Netflix.

It provides various metadata such as the type of content, cast, genres, country of origin, release details, and more. This dataset can be useful for content analysis, recommendation system development, or trend studies.

This dataset is collected from Flixable which is a third-party Netflix search engine.


Using this dataset, we answered multiple questions with Python in our Project.

Q. 1) For 'House of Cards', what is the Show Id and Who is the Director of this show ?

Q. 2) In which year the highest number of the TV Shows & Movies were released ? Show with Bar Graph.

Q. 3) How many Movies & TV Shows are in the dataset ? Show with Bar Graph.

Q. 4) Show all the Movies that were released in year 2000.

Q. 5) Show only the Titles of all TV Shows that were released in India only.

Q. 6) Show Top 10 Directors, who gave the highest number of TV Shows & Movies to Netflix ?

Q. 7) Show all the Records, where "Category is Movie and Type is Comedies" or "Country is United Kingdom".

Q. 8) In how many movies/shows, Tom Cruise was cast ?

Q. 9) What are the different Ratings defined by Netflix ?
Q. 9.1) How many Movies got the 'TV-14' rating, in Canada ?
Q. 9.2) How many TV Shows got the 'R' rating, after year 2018 ?

Q. 10) What is the maximum duration of a Movie/Show on Netflix ?

Q. 11) Which individual country has the Highest No. of TV Shows ?

Q. 12) How can we sort the dataset by Year ?

Q. 13) Find all the instances where: Category is 'Movie' and Type is 'Dramas' or Category is 'TV Show' & Type is 'Kids' TV'.


These are the main Features/Columns available in the dataset :

  • Show_Id: A unique identifier assigned to each Netflix title (e.g., s1, s2...).

  • Category: Indicates whether the content is a Movie or a TV Show.

  • Title: The name of the movie or TV show as it appears on Netflix.

  • Director: The name(s) of the director(s). This can be empty for some TV shows or content with no known director.

  • Cast: List of main actors and actresses featured in the title. It may contain multiple names, separated by commas.

  • Country: The country (or countries) where the content was produced or released.

  • Release_Date: The date on which the content was made available on Netflix.

  • Rating: The maturity rating of the content (e.g., TV-MA, PG-13, R), indicating the appropriate audience.

  • Duration: For movies, this shows the length in minutes (e.g., "93 min"). For TV shows, it displays the number of seasons (e.g., "4 Seasons").

  • Type: Genres or categories that describe the content (e.g., "Dramas", "Horror Movies", "International TV Shows").

  • Description: A short synopsis or summary of the movie or TV show.

Tables

Netflix Dataset

@kaggle.rohitgrewal_netflix_data.netflix_dataset
  • 1.68 MB
  • 7789 rows
  • 11 columns
Loading...

CREATE TABLE netflix_dataset (
  "show_id" VARCHAR,
  "category" VARCHAR,
  "title" VARCHAR,
  "director" VARCHAR,
  "cast" VARCHAR,
  "country" VARCHAR,
  "release_date" VARCHAR,
  "rating" VARCHAR,
  "duration" VARCHAR,
  "type" VARCHAR,
  "description" VARCHAR
);

Share link

Anyone who has the link will be able to view this.