Baselight

Data Breaches

30,000 Records of cyber-security data breaches

@kaggle.thedevastator_data_breaches_a_comprehensive_list

Loading...
Loading...

About this Dataset

Data Breaches

Data Breaches Dataset

30,000 Records of cyber-security data breaches


About this dataset

This dataset is a compilation of data from various sources detailing data breaches. These sources include press reports, government news releases, and mainstream news articles. The list includes those involving the theft or compromise of 30,000 or more records, although many smaller breaches occur continually. In addition, the various methods used in the breaches are listed, with hacking being the most common.

Organizations of all types and sizes are susceptible to data breaches, which can have devastating consequences. This dataset can help shed light on which organizations are most at risk and how these breaches occur so that steps can be taken to prevent them in the future

How to use the dataset

There are many ways to use this dataset. Here are a few ideas:

  • Use the data to understand which types of organizations are most commonly breached, and what methods are used most often.
  • Analyze the data to see if there are any trends or patterns in when or how breaches occur.
  • Use the data to create a visualizations or infographic showing the prevalence of data breaches

Research Ideas

  • This dataset can be used to identify trends in data breaches in terms of methods used, types of organizations breached, and geographical distribution.

  • This dataset can be used to study the effect of data breaches on organizational reputation and customer trust.

  • This dataset can be used by organizations to benchmark their own security measures against those of similar organizations that have experienced data breaches

Acknowledgements

License

> License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
> No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: df_1.csv

Column name Description
Entity The name of the organization that was breached. (String)
Year The year when the breach occurred. (Integer)
Records The number of records that were compromised in the breach. (Integer)
Organization type The type of organization that was breached. (String)
Method The method that was used to breach the organization. (String)
Sources The sources from which the data was collected. (String)

Tables

Df 1

@kaggle.thedevastator_data_breaches_a_comprehensive_list.df_1
  • 19.28 KB
  • 352 rows
  • 7 columns
Loading...

CREATE TABLE df_1 (
  "unnamed_0" BIGINT,
  "entity" VARCHAR,
  "year" VARCHAR,
  "records" VARCHAR,
  "organization_type" VARCHAR,
  "method" VARCHAR,
  "sources" VARCHAR
);

Share link

Anyone who has the link will be able to view this.