Dataset Photo by Nicolas J Leclercq on Unsplash
This is a collection of both critic and audience reviews for 685 different online streaming shows off of Rotten Tomatoes. The data was scraped the week of July 10, 2022. Reviews found in the files are previews in most cases, especially when the critic review is long. There are also duplicates of some reviews in the case that a show may be found on more than one online streaming service (either paid or with a subscription).
Data was collected for the top 100 most popular shows from 9 streaming platforms: 'Apple TV+', 'Paramount+', 'HBO MAX', 'Disney+', 'Prime Video', 'Hulu', 'Netflix', 'Peacock', and 'VUDU'. In most cases, there are not 100 shows that show up with reviews for each service. Why is this? Well, a lot of shows just didn't have reviews by either critics or audience members. Some services also don't have 100 shows yet due to being newer. This is the case with Apple TV+ for instance. Additionally, Rotten Tomatoes apparently goes down for maintenance a lot. This affected the collection of reviews greatly, and I am not sure if I got every single one. Oh well.
About the Files
tv_show_links.csv: This file is essentially a list of all shows whose data was scraped. Other information includes the network (or networks in some cases) a show can be found on, the percentage of critics who had positive reviews, the percentage of audience members who had positive reviews, and the Rotten Tomatoes link the reviews can be found at (mostly was used during scraping, not sure what use it would have in any data analysis).
audience_reviews.csv: This is the file that contains audience reviews. The columns are the show that is reviewed, the rating (on a scale from 0-5), and the review text.
critic_reviews.csv: The file that contains critic reviews. The columns are the reviewed show, the sentiment the critic has (1 for positive, 0 for negative), and the review text.