Data Breaches
30,000 Records of cyber-security data breaches
@kaggle.thedevastator_data_breaches_a_comprehensive_list
30,000 Records of cyber-security data breaches
@kaggle.thedevastator_data_breaches_a_comprehensive_list
This dataset is a compilation of data from various sources detailing data breaches. These sources include press reports, government news releases, and mainstream news articles. The list includes those involving the theft or compromise of 30,000 or more records, although many smaller breaches occur continually. In addition, the various methods used in the breaches are listed, with hacking being the most common.
Organizations of all types and sizes are susceptible to data breaches, which can have devastating consequences. This dataset can help shed light on which organizations are most at risk and how these breaches occur so that steps can be taken to prevent them in the future
There are many ways to use this dataset. Here are a few ideas:
- Use the data to understand which types of organizations are most commonly breached, and what methods are used most often.
- Analyze the data to see if there are any trends or patterns in when or how breaches occur.
- Use the data to create a visualizations or infographic showing the prevalence of data breaches
This dataset can be used to identify trends in data breaches in terms of methods used, types of organizations breached, and geographical distribution.
This dataset can be used to study the effect of data breaches on organizational reputation and customer trust.
This dataset can be used by organizations to benchmark their own security measures against those of similar organizations that have experienced data breaches
License
> License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
> No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
File: df_1.csv
| Column name | Description |
|---|---|
| Entity | The name of the organization that was breached. (String) |
| Year | The year when the breach occurred. (Integer) |
| Records | The number of records that were compromised in the breach. (Integer) |
| Organization type | The type of organization that was breached. (String) |
| Method | The method that was used to breach the organization. (String) |
| Sources | The sources from which the data was collected. (String) |
CREATE TABLE df_1 (
"unnamed_0" BIGINT -- Unnamed: 0,
"entity" VARCHAR,
"year" VARCHAR,
"records" VARCHAR,
"organization_type" VARCHAR,
"method" VARCHAR,
"sources" VARCHAR
);Anyone who has the link will be able to view this.