Baselight
Sign In
kaggle

Wikipedia Movie Plots

Kaggle

@kaggle.jrobischon_wikipedia_movie_plots

Loading...
Loading...

Plot descriptions for ~35,000 movies

Dataset Description

Context

Plot summary descriptions scraped from Wikipedia

Content

The dataset contains descriptions of 34,886 movies from around the world. Column descriptions are listed below:

  • Release Year - Year in which the movie was released
  • Title - Movie title
  • Origin/Ethnicity - Origin of movie (i.e. American, Bollywood, Tamil, etc.)
  • Director - Director(s)
  • Plot - Main actor and actresses
  • Genre - Movie Genre(s)
  • Wiki Page - URL of the Wikipedia page from which the plot description was scraped
  • Plot - Long form description of movie plot (WARNING: May contain spoilers!!!)

Inspiration

Content-Based Movie Recommender:
Recommend movies with plots similar to those that a user has rated highly.

Movie Plot Generator:
Generate a movie plot description based on seed input, such as director and genre

Information Retrieval:
Return a movie title based on an input plot description

Text Classification:
Predict movie genre based on plot description

Acknowledgements

This data was scraped from Wikipedia


Related Datasets

Share link

Anyone who has the link will be able to view this.