Baselight

BollyBAIT: Bollywood Clickbait Dataset

Identifying, Categorizing & Analyzing Bollywood Videos

@kaggle.thedevastator_bollybait_a_clickbait_categorization_bollywood_c

Loading...
Loading...

About this Dataset

BollyBAIT: Bollywood Clickbait Dataset


BollyBAIT: A Clickbait Categorization & Bollywood Characteristics Dataset

Identifying, Categorizing & Analyzing Bollywood Videos

By [source]


About this dataset

Welcome to BollyBAIT -- the revolutionary dataset designed to explore the categorization of clickbaits, real/non-clickbaits and characteristics of Bollywood videos! This first-of-its-kind dataset focuses on 1000 Bollywood videos, 500 of which are clickbait and 500 are real/non-clickbait videos. As Bollywood is a distinctive domain where content ranges from video interviews with celebs to trailer launches, songs, movie trailers and social media news updates this dataset provides invaluable insights into this industry.

Prepare yourself for data mining as you will gain unparalleled insight into the popularity of Bollywood content, audience attitudes and preferences towards various topics, comparative understanding between an exaggerated or misleading video and one that is truthful or informative – all these without ever leaving the comfort of your desk! This dataset can be used by market researchers in order identify clickbaits in order develop even more effective digital advertisement strategies for their campaigns as well as help protect consumers from false promises perpetuated by misleading digital messaging. So explore - analyze – search within this amazing new resource – get started with BollyBAIT today!
Columns: Title, Channel Title, Label

More Datasets

For more datasets, click here.

Featured Notebooks

  • 🚨 Your notebook can be here! 🚨!

How to use the dataset

How to Use This Dataset

This dataset offers a valuable resource for data scientists, researchers, and anyone interested in exploring the categorization of clickbaits and real/non-clickbait videos in Bollywood. Users can also gain insights into the characteristics of Bollywood movies from this dataset. In order to use this dataset, users will need basic understanding and experience with data manipulation software like Excel or SPSS. Additionally, users should be familiar with basic statistical analysis such as descriptive statistics, hypothesis testing and regression analysis.

To get started, the first step is to download the data set from Kaggle. Once you have downloaded it on your local machine the next step is to open it either using Excel or an appropriate statistical package like SPSS (IBM Package) or RStudio (Cross-Platform). The columns contain information about each video such as title, channel title (the title of the channel that uploaded the video), label (whether it is clickbait or not) and BollyBAIT_dataset.csv containing detailed information about 1000 videos such as video IDs and URLs related to each video.

After opening up your sheet you can start carrying out exploratory data analysis using descriptive statistics methods like mean & median etc., testing different hypotheses by running t-test/ANOVA/, performing regression analysis on categories according to labels present in each row; plotting charts/graphs etc., which could help understanding deeper insights about categorization of Clickbaits & real non-clickbaits videos in Bollywood along with characteristic trends in various domains related its content production usages etc.. It might take some time for a novice user but once your comfortable playing around with these technologies - there would be endless possibilities you come across with this great dataset! Enjoy Exploring!

Research Ideas

  • Developing video categorization models and discovering underlying types of content in Bollywood videos.
  • Creating a classifier to detect whether a video is clickbait or not by using the label information provided in the dataset.
  • Analyzing domain-specific data trends like popularity of certain actors, genres, movie trends and identifying viral video trends within the Bollywood industry

Acknowledgements

If you use this dataset in your research, please credit the original authors.
Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: BollyBAIT_dataset.csv

Column name Description
Title The title of the video. (String)
Channel Title The title of the channel that uploaded the video. (String)
Label Indicates whether the video is a clickbait or not. (Clickbait or Non-Clickbait)

Acknowledgements

If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit .

Tables

Bollybait Dataset

@kaggle.thedevastator_bollybait_a_clickbait_categorization_bollywood_c.bollybait_dataset
  • 84.79 KB
  • 996 rows
  • 10 columns
Loading...

CREATE TABLE bollybait_dataset (
  "title" VARCHAR,
  "video_id" VARCHAR,
  "video_url" VARCHAR,
  "channel_title" VARCHAR,
  "misleading_video" VARCHAR,
  "false_promises" VARCHAR,
  "exaggerated_video" VARCHAR,
  "spam_content" VARCHAR,
  "exploits_curiosity_gap" VARCHAR,
  "label" VARCHAR
);

Share link

Anyone who has the link will be able to view this.