YouTube Comments Analysis on Oscar-Nominated Movie Trailers
Uncovering User-Generated Content
By PromptCloud [source]
About this dataset
This dataset provides an in-depth look into user-generated content for the Oscar-nominated movie trailers of the 2018 season. Created by PromptCloud, this data is a great starting point in understanding how people are engaging with these movies across YouTube. The data includes time stamps, comment text, likes and number of replies from Datasets collected till 6th March 2018. This can be used to create network graph visualizations and sentiment analysis as part of further study. PromptCloud used itβs internal web crawler to extract this filled with valuable insights about what conversations are taking place around these popular movies β which you can use to craft powerful targeted marketing campaigns!
More Datasets
For more datasets, click here.
Featured Notebooks
- π¨ Your notebook can be here! π¨!
How to use the dataset
How to Use this Dataset
This dataset contains user-generated content from YouTube comments on Oscar-nominated movie trailers. It includes the timestamp, comment text, likes, has replies and number of replies for each comment so you can gain insight into what viewers think about the movies. Here are some ways you can use this dataset:
-
Analyze viewer sentiment on the nominated movies by performing sentiment analysis on the collected YouTube comments.
-
Vizualize relationships between commenters by creating a social network graph based on who replied to whom.
-
Identify trends in viewership over time by plotting out viewer engagement metrics such as likes and number of replies versus time stamp of when a comment was posted
This data was collected using PromptCloud's internal web crawler and initial study has been performed including sentiment analysis and network graph visualizations which you can view [here](https://www.promptcloud
Research Ideas
- Analyzing the amount of positive/negative sentiment in comments about an Oscar-nominated movie before and after its release.
- Creating a network graph to determine the relationships between commenters in response to the movie trailer and overall opinion on it.
- Predicting how many likes and replies a comment will receive based on the content of its text, date posted, and other factors such as user demographics or platform used to post it
Acknowledgements
If you use this dataset in your research, please credit the original authors.
Data Source
License
License: Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- You are free to:
- Share - copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt - remix, transform, and build upon the material for any purpose, even commercially.
- You must:
- Give appropriate credit - Provide a link to the license, and indicate if changes were made.
- ShareAlike - You must distribute your contributions under the same license as the original.
Columns
File: Dunkirk.csv
Column name |
Description |
timestamp |
The time the comment was posted. (DateTime) |
comment_text |
The text of the comment. (String) |
likes |
The number of likes the comment has received. (Integer) |
has_replies |
Whether or not the comment has replies. (Boolean) |
number_of_comments |
The number of replies the comment has received. (Integer) |
File: Three Billboards Outside Ebbing, Missouri.csv
Column name |
Description |
timestamp |
The time the comment was posted. (DateTime) |
comment_text |
The text of the comment. (String) |
likes |
The number of likes the comment has received. (Integer) |
has_replies |
Whether or not the comment has replies. (Boolean) |
number_of_replies |
The number of replies the comment has received. (Integer) |
Acknowledgements
If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit PromptCloud.