Baselight

TIMDB - Bollywood Films

A Data-Driven Approach to Bollywood

@kaggle.thedevastator_timdb_bollywood_films

About this Dataset

TIMDB - Bollywood Films

TIMDB - Bollywood Films

A Data-Driven Approach to Bollywood


About this dataset

The Indian Movie Database (IMDB) is a data-driven approach to understanding and studying Bollywood, the Hindi-language film industry based in Mumbai, India. The IMDB offers a comprehensive and up-to-date dataset of both contemporary and classic Bollywood films, complete with movie information, ratings, text data, and more.

This dataset provides a wealth of information on Bollywood movies that can be used to understand the industry and its workings. With over 10,000 movies in the database, there is something for everyone. Whether you are looking for information on a specific movie or wanting to explore the genre trends of Bollywood over time, this dataset has you covered

How to use the dataset

This dataset contains information on over 10,000 Bollywood films, including title, release date, runtime, genres, ratings, and more. The data is divided into four csv files: bollywood.csv, bollywood_meta.csv, bollywood_ratings.csv, and bollywood_text.csv.

The bollywood.csv file contains information on contemporary and classic Bollywood films, including movie title, IMDB ID

Research Ideas

  • Identifying emerging Bollywood stars and predicting their future success
  • Analyzing the relationship between movie ratings and box office performance
  • Studying the impact of social media on Bollywood movie viewership

Acknowledgements

The Indian Movie Database (IMDB) is a data-driven approach to understanding and studying Bollywood, the Hindi-language film industry based in Mumbai, India. The IMDB offers a comprehensive and up-to-date dataset of both contemporary and classic Bollywood films, complete with movie information, ratings, text data, and more.

The Indian Movie Database (IMDB) was created by Rakesh Jain and Tushar Aneja. Jain is a Computer Science graduate from IIT Roorkee and Aneja is an Electrical Engineering graduate also from IIT Roorkee

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: bollywood.csv

Column name Description
title The title of the movie. (String)
poster_path The path to the movie poster. (String)
wiki_link The Wikipedia link for the movie. (String)

File: bollywood_meta.csv

Column name Description
title The title of the movie. (String)
original_title The original title of the movie. (String)
is_adult Whether or not the movie is suitable for adults. (Boolean)
year_of_release The year the movie was released. (Integer)
runtime The runtime of the movie in minutes. (Integer)
genres The genres the movie belongs to. (String)

File: bollywood_ratings.csv

Column name Description
imdb_rating The rating of the movie on IMDB. (Float)
imdb_votes The number of votes the movie has received on IMDB. (Integer)

File: bollywood_text.csv

Column name Description
story The story of the film. (Text)
summary A brief summary of the film. (Text)
tagline The tagline of the film. (Text)
actors The actors in the film. (Text)
wins_nominations The film's awards and nominations. (Text)
release_date The release date of the film. (Date)

File: bollywood_crew.csv

Column name Description
directors The directors of the movie. (String)
writers The writers of the movie. (String)

File: bollywood_crew_data.csv

Column name Description
name The name of the actor or actress. (String)
born_year The year the actor or actress was born. (Integer)
death_year The year the actor or actress died. (Integer)
profession The profession of the actor or actress. (String)
known_for The actor or actress's known for roles. (String)

File: bollywood_full.csv

Column name Description
poster_path The path to the movie poster. (String)
wiki_link The Wikipedia link for the movie. (String)
original_title The original title of the movie. (String)
is_adult Whether or not the movie is suitable for adults. (Boolean)
year_of_release The year the movie was released. (Integer)
runtime The runtime of the movie in minutes. (Integer)
genres The genres the movie belongs to. (String)
imdb_rating The rating of the movie on IMDB. (Float)
imdb_votes The number of votes the movie has received on IMDB. (Integer)
story The story of the film. (Text)
summary A brief summary of the film. (Text)
tagline The tagline of the film. (Text)
actors The actors in the film. (Text)
wins_nominations The film's awards and nominations. (Text)
release_date The release date of the film. (Date)
title_x The title of the movie. (String)
title_y The title of the movie. (String)

File: bollywood_writers_data.csv

Column name Description
name The name of the actor or actress. (String)
born_year The year the actor or actress was born. (Integer)
death_year The year the actor or actress died. (Integer)
profession The profession of the actor or actress. (String)
known_for The actor or actress's known for roles. (String)

File: genome_scores.csv

Column name Description
relevance A short description of the column. (Column Type)

File: genome_tags.csv

Column name Description
tag The name of the movie. (String)

File: links.csv


File: ratings.csv

Column name Description
rating The rating of the movie on a scale of 1 to 10. (Numeric)
timestamp The date and time when the rating was made. (DateTime)

File: tags.csv

Column name Description
tag The name of the movie. (String)
timestamp The date and time when the rating was made. (DateTime)

File: titles.csv

Column name Description
title The title of the movie. (String)

Share link

Anyone who has the link will be able to view this.