Baselight

Popular 10,000 Movies Listed On TMDb (latest)

Popular 10000 English movies (latest)

@kaggle.manashjyotiborah_top_10000_movies_hosted_on_tmdb

Loading...
Loading...

About this Dataset

Popular 10,000 Movies Listed On TMDb (latest)

Context:

These files contain TMDb datasets of popular 10000 English movies. Key data points of the datasets are: cast, crew, keywords, genre, overview, popularity, vote_average, vote_count, title etc.

Acknowledegements:

The cast, crew, keywords, and movie informations have been collected from the TMDb using the public API through web-scraping. Additional information regarding movies can be accessed through the public API on the TMDb website. However, these datasets contain only the relevant columns that are required for building recommendation systems.

If you need an extra feature/column, feel free to post it on the discussion tab.

Useful for:

  • building recommendation systems based on content-based, collaborative filtering based or hybrid approach.

Please consider an UPVOTE if you found it useful.

Tables

Cast Dataset

@kaggle.manashjyotiborah_top_10000_movies_hosted_on_tmdb.cast_dataset
  • 9.74 MB
  • 10000 rows
  • 2 columns
Loading...

CREATE TABLE cast_dataset (
  "id" BIGINT,
  "cast" VARCHAR
);

Crew Dataset

@kaggle.manashjyotiborah_top_10000_movies_hosted_on_tmdb.crew_dataset
  • 12.71 MB
  • 10000 rows
  • 2 columns
Loading...

CREATE TABLE crew_dataset (
  "id" BIGINT,
  "crew" VARCHAR
);

Keywords Dataframe

@kaggle.manashjyotiborah_top_10000_movies_hosted_on_tmdb.keywords_dataframe
  • 1.19 MB
  • 10000 rows
  • 3 columns
Loading...

CREATE TABLE keywords_dataframe (
  "unnamed_0" BIGINT,
  "id" BIGINT,
  "keywords" VARCHAR
);

Movies Dataset

@kaggle.manashjyotiborah_top_10000_movies_hosted_on_tmdb.movies_dataset
  • 2.21 MB
  • 10000 rows
  • 10 columns
Loading...

CREATE TABLE movies_dataset (
  "genre_ids" VARCHAR,
  "id" BIGINT,
  "original_language" VARCHAR,
  "overview" VARCHAR,
  "popularity" DOUBLE,
  "release_date" TIMESTAMP,
  "title" VARCHAR,
  "vote_average" DOUBLE,
  "vote_count" BIGINT,
  "genres" VARCHAR
);

Share link

Anyone who has the link will be able to view this.