Work Related Injury Data (2016-2021)
1M injury records from over 100,000 U.S. workplaces between 2016 and 2021
@kaggle.robikscube_osha_injury_data_20162021
1M injury records from over 100,000 U.S. workplaces between 2016 and 2021
@kaggle.robikscube_osha_injury_data_20162021
https://www.osha.gov/Establishment-Specific-Injury-and-Illness-Data
Per the data dictionary:
Note: Rows that appear to be missing from the publicly available data are associated with test records that were entered into and deleted from the system at various times. For example, ITA developers periodically take down the system to perform maintenance. After it is relaunched, they submit multiple test records using test accounts to ensure that the various input methods that employers can use to submit data to the system are working correctly. These records are flagged and removed from the system.
Data Element Definition
ID Unique number for each recordestablishment_name The name of the establishment reporting data.ein Employer Identification Number (EIN) is also known as Federal Tax Identification Numbercompany_name The name of the company that owns the establishment.street_address The street address of the establishment.city The city where the establishment is located.state The state where the establishment is located.zip_code The full zip code for the establishment.naics_code The North American Industry Classification System (NAICS) code which classifies an establishment’s business.industry_description Industry Descriptionsize The size of the establishment based on the maximum number of employees that worked there at any point in the year you are submitting data for.
year_filing_for The calendar year in which the injuries and illnesses reported occurredannual_average_employees Annual Average Number of Employeestotal_hours_worked Total hours worked by all employeesno_injuries_illnesses Whether the establishment had any OSHA recordable work-related injuries` or illnesses during the year.total_deaths Total number of deaths (Form 300A Field G)total_dafw_cases Total number of cases with days away from work (Form 300A Field H)total_djtr_cases Total number of cases with job transfer or restriction (Form 300A Field I)total_other_cases Total number of other recordable cases (Form 300A Field J)total_dafw_days Total number of days away from work (Form 300A Field K)total_djtr_days Total number of days of job transfer or restriction (Form 300A Field L)total_injuries Total number of injuries (Form 300A Field M(1))
total_skin_disorders Total number of skin disorders (Form 300A Field M(2))total_respiratory_conditions Total number of respiratory conditions (Form 300A Field M(3))total_poisonings Total number of poisonings (Form 300A Field M(4))total_hearing_loss Total number of hearing loss (Form 300A Field M(5))total_other_illnesses Total number of all other illnesses (Form 300A Field M(6))establishment_ID Unique number for each establishment created under an individual user accountcreated_timestamp The date and time a record was submitted to the ITAChange_reason The reason why an establishment’s injury and illness summary was changed, if applicable --CREATE TABLE ita_data_cy_2016 (
"id" BIGINT,
"company_name" VARCHAR,
"establishment_name" VARCHAR,
"ein" VARCHAR,
"street_address" VARCHAR,
"city" VARCHAR,
"state" VARCHAR,
"zip_code" BIGINT,
"naics_code" BIGINT,
"industry_description" VARCHAR,
"annual_average_employees" BIGINT,
"total_hours_worked" DOUBLE,
"no_injuries_illnesses" BIGINT,
"total_deaths" BIGINT,
"total_dafw_cases" BIGINT,
"total_djtr_cases" BIGINT,
"total_other_cases" BIGINT,
"total_dafw_days" BIGINT,
"total_djtr_days" BIGINT,
"total_injuries" BIGINT,
"total_poisonings" BIGINT,
"total_respiratory_conditions" BIGINT,
"total_skin_disorders" BIGINT,
"total_hearing_loss" BIGINT,
"total_other_illnesses" BIGINT,
"establishment_id" BIGINT,
"establishment_type" DOUBLE,
"size" BIGINT,
"year_filing_for" BIGINT,
"created_timestamp" TIMESTAMP,
"change_reason" VARCHAR
);CREATE TABLE ita_data_cy_2017 (
"id" BIGINT,
"company_name" VARCHAR,
"establishment_name" VARCHAR,
"ein" DOUBLE,
"street_address" VARCHAR,
"city" VARCHAR,
"state" VARCHAR,
"zip_code" BIGINT,
"naics_code" BIGINT,
"industry_description" VARCHAR,
"annual_average_employees" BIGINT,
"total_hours_worked" DOUBLE,
"no_injuries_illnesses" BIGINT,
"total_deaths" BIGINT,
"total_dafw_cases" BIGINT,
"total_djtr_cases" BIGINT,
"total_other_cases" BIGINT,
"total_dafw_days" BIGINT,
"total_djtr_days" BIGINT,
"total_injuries" BIGINT,
"total_poisonings" BIGINT,
"total_respiratory_conditions" BIGINT,
"total_skin_disorders" BIGINT,
"total_hearing_loss" BIGINT,
"total_other_illnesses" BIGINT,
"establishment_id" BIGINT,
"establishment_type" DOUBLE,
"size" BIGINT,
"year_filing_for" BIGINT,
"created_timestamp" TIMESTAMP,
"change_reason" VARCHAR
);CREATE TABLE ita_data_cy_2019 (
"id" DOUBLE,
"company_name" VARCHAR,
"establishment_name" VARCHAR,
"ein" DOUBLE,
"street_address" VARCHAR,
"city" VARCHAR,
"state" VARCHAR,
"zip_code" DOUBLE,
"naics_code" DOUBLE,
"industry_description" VARCHAR,
"annual_average_employees" DOUBLE,
"total_hours_worked" DOUBLE,
"no_injuries_illnesses" DOUBLE,
"total_deaths" DOUBLE,
"total_dafw_cases" BIGINT,
"total_djtr_cases" DOUBLE,
"total_other_cases" DOUBLE,
"total_dafw_days" DOUBLE,
"total_djtr_days" DOUBLE,
"total_injuries" DOUBLE,
"total_poisonings" DOUBLE,
"total_respiratory_conditions" DOUBLE,
"total_skin_disorders" DOUBLE,
"total_hearing_loss" DOUBLE,
"total_other_illnesses" DOUBLE,
"establishment_id" DOUBLE,
"establishment_type" VARCHAR,
"size" DOUBLE,
"year_filing_for" DOUBLE,
"created_timestamp" TIMESTAMP,
"change_reason" VARCHAR
);CREATE TABLE ita_data_cy_2020 (
"id" BIGINT,
"company_name" VARCHAR,
"establishment_name" VARCHAR,
"ein" DOUBLE,
"street_address" VARCHAR,
"city" VARCHAR,
"state" VARCHAR,
"zip_code" BIGINT,
"naics_code" BIGINT,
"industry_description" VARCHAR,
"annual_average_employees" BIGINT,
"total_hours_worked" BIGINT,
"no_injuries_illnesses" BIGINT,
"total_deaths" BIGINT,
"total_dafw_cases" BIGINT,
"total_djtr_cases" BIGINT,
"total_other_cases" BIGINT,
"total_dafw_days" BIGINT,
"total_djtr_days" BIGINT,
"total_injuries" BIGINT,
"total_poisonings" BIGINT,
"total_respiratory_conditions" BIGINT,
"total_skin_disorders" BIGINT,
"total_hearing_loss" BIGINT,
"total_other_illnesses" BIGINT,
"establishment_id" BIGINT,
"establishment_type" DOUBLE,
"size" BIGINT,
"year_filing_for" BIGINT,
"created_timestamp" TIMESTAMP,
"change_reason" VARCHAR
);CREATE TABLE ita_data_cy_2021_submitted_thru_3_14_2022_v3 (
"id" DOUBLE,
"company_name" VARCHAR,
"establishment_name" VARCHAR,
"ein" DOUBLE,
"street_address" VARCHAR,
"city" VARCHAR,
"state" VARCHAR,
"zip_code" BIGINT,
"naics_code" BIGINT,
"industry_description" VARCHAR,
"annual_average_employees" DOUBLE,
"total_hours_worked" DOUBLE,
"no_injuries_illnesses" DOUBLE,
"total_deaths" DOUBLE,
"total_dafw_cases" DOUBLE,
"total_djtr_cases" DOUBLE,
"total_other_cases" DOUBLE,
"total_dafw_days" DOUBLE,
"total_djtr_days" DOUBLE,
"total_injuries" DOUBLE,
"total_poisonings" DOUBLE,
"total_respiratory_conditions" DOUBLE,
"total_skin_disorders" DOUBLE,
"total_hearing_loss" DOUBLE,
"total_other_illnesses" DOUBLE,
"establishment_id" DOUBLE,
"establishment_type" DOUBLE,
"size" DOUBLE,
"year_filing_for" DOUBLE,
"created_timestamp" TIMESTAMP,
"change_reason" VARCHAR,
"delete" BIGINT
);CREATE TABLE ita_osha_combined (
"id" BIGINT,
"company_name" VARCHAR,
"establishment_name" VARCHAR,
"ein" DOUBLE,
"street_address" VARCHAR,
"city" VARCHAR,
"state" VARCHAR,
"zip_code" DOUBLE,
"naics_code" BIGINT,
"industry_description" VARCHAR,
"annual_average_employees" BIGINT,
"total_hours_worked" DOUBLE,
"no_injuries_illnesses" DOUBLE,
"total_deaths" BIGINT,
"total_dafw_cases" BIGINT,
"total_djtr_cases" BIGINT,
"total_other_cases" BIGINT,
"total_dafw_days" BIGINT,
"total_djtr_days" BIGINT,
"total_injuries" BIGINT,
"total_poisonings" BIGINT,
"total_respiratory_conditions" BIGINT,
"total_skin_disorders" BIGINT,
"total_hearing_loss" BIGINT,
"total_other_illnesses" BIGINT,
"establishment_id" BIGINT,
"establishment_type" VARCHAR,
"size" BIGINT,
"year_filing_for" BIGINT,
"created_timestamp" TIMESTAMP,
"change_reason" VARCHAR,
"source" VARCHAR,
"delete" DOUBLE
);Anyone who has the link will be able to view this.