Baselight

TV Shows

A full, 3NF database that about current TV Shows (Approximately 160K Shows)

@kaggle.denizbilginn_tv_shows

About this Dataset

TV Shows

Welcome to TV Shows database, the database includes information of approximately 160K shows. The data of the database is updated in Jan 2024.

I carefully pre-processed the database as 3NF. I gather the dataset from Asaniczka's dataset and the I pre-processed it.
https://www.kaggle.com/datasets/asaniczka/full-tmdb-tv-shows-dataset-2023-150k-shows

You can use this database to research human taste tendencies, AI applications and more...

There is a ER diagram, you can use forward checking to create whole database clearly. The ER diagram created in MySQL. After creating the database, you can upload CSV tables to the SQL tables.

Interesting Task Ideas:

  1. Explore trends in TV show popularity based on vote count and average.
  2. Analyze TV show genres to identify the most popular genres or combinations of genres.
  3. Investigate the relationship between TV show ratings and the number of seasons and episodes.
  4. Build a recommendation system that suggests TV shows based on a user's favorite genres or languages.
  5. Predict the success of a TV show based on features like vote count, average, and popularity.
  6. Identify the most prolific TV show creators or production companies based on the number of shows they have created.
  7. Explore the distribution of TV show run times and investigate whether episode duration affects the overall ratings.
  8. Investigate TV show production trends across different countries and networks.
  9. Analyze the relationship between TV show language and popularity, and investigate the popularity of non-English shows.
  10. Track the status of TV shows (in production or not) and analyze their popularity over time.
  11. Develop a language analysis model to identify sentiment or themes from TV show overviews.

Please feel free to ask any questions about the data in the discussion section.