Baselight

Top 1000 IMDB Dataset

1000 movies dataset to be cleaned

@kaggle.fernandogarciah24_top_1000_imdb_dataset

Loading...
Loading...

About this Dataset

Top 1000 IMDB Dataset

While in a Google course I stumbled upon this dataset in Kaggle, I thought that it would be interesting to work on it as I really like movies and it can help me to learn more on data cleaning on R.
I uploaded 2 datasets, the original one for anyone who would like to use it and the second one is the one that I cleaned.

Tables

Imdb Top 1000

@kaggle.fernandogarciah24_top_1000_imdb_dataset.imdb_top_1000
  • 254.73 KB
  • 1000 rows
  • 16 columns
Loading...

CREATE TABLE imdb_top_1000 (
  "poster_link" VARCHAR,
  "series_title" VARCHAR,
  "released_year" VARCHAR,
  "certificate" VARCHAR,
  "runtime" VARCHAR,
  "genre" VARCHAR,
  "imdb_rating" DOUBLE,
  "overview" VARCHAR,
  "meta_score" DOUBLE,
  "director" VARCHAR,
  "star1" VARCHAR,
  "star2" VARCHAR,
  "star3" VARCHAR,
  "star4" VARCHAR,
  "no_of_votes" BIGINT,
  "gross" DOUBLE
);

Movies Data

@kaggle.fernandogarciah24_top_1000_imdb_dataset.movies_data
  • 103.55 KB
  • 1000 rows
  • 16 columns
Loading...

CREATE TABLE movies_data (
  "series_title" VARCHAR,
  "released_year" VARCHAR,
  "certificate" VARCHAR,
  "runtime" BIGINT,
  "genre" VARCHAR,
  "subgenre" VARCHAR,
  "subgenre_1" VARCHAR,
  "imdb_rating" DOUBLE,
  "meta_score" DOUBLE,
  "director" VARCHAR,
  "star1" VARCHAR,
  "star2" VARCHAR,
  "star3" VARCHAR,
  "star4" VARCHAR,
  "no_of_votes" BIGINT,
  "gross" DOUBLE
);

Share link

Anyone who has the link will be able to view this.