Violent Crime in CA
Regional, County, City/Town Rates 2006-2010
By Health [source]
About this dataset
This dataset contains information on the rate of violent crime across California - its regions, counties, cities and towns. The data was collected as part of a larger effort by the Office of Health Equity to better understand public health indicators and ensure equitable outcomes for all.
The numbers reflect more than just a problem in California communities - it reflects a problem with unequal access to resources and opportunity across race, ethnicities and geographies. African Americans in California are 11 times more likely to die from assault or homicide compared to white Californians. Similarly, certain regions report higher crime rates than others at the county level- indicating underlying issues with poverty or institutionalized inequality.
Law enforcement agencies teamed up with the Federal Bureau of Investigations’ Uniform Crime Reports to collect this data table which includes details such as reported number of violent crimes (numerator), population size (denominator), rate per 1,000 population (ratex1000) confidence intervals (LL_95CI & UL_95CI ) standard errors & relative standard errors (se & rse) as well as ratios between city/town rates vs state rates (RR_city2state). Additionally, each record is classified according to region name/code and race/ethnicity code/name , giving researchers further insight into these troubling statistics at both macro and micro levels.
Armed with this information we can explore new ways identify inequitable areas and begin looking for potential solutions that combat health disparities within our communities like never before!
More Datasets
For more datasets, click here.
Featured Notebooks
- 🚨 Your notebook can be here! 🚨!
How to use the dataset
The data is presented with twenty columns providing various segments within each row including:
- Crime definition
- Race/ethnicity code
- Region code
- Geographic area identifier
- Numerator and Denominator values of population
- Standard Error and 95% Confidence Intervals
- Relatvie Standard Error (RSE) value
- Ratios related to city/towns rate to state rate
The information provided can be used for a variety of applications such as creating visualizations or developing predictive models. It is important to note that rates are expressed per 1,000 population for their respective geographic area during each period noted by the report year field within the dataset. Additionally CA_decile column may be useful in comparing counties due numerical grading system identifying a region’s percentile ranking when compared to other counties within the current year’s entire dataset as well as ratios present under RR_city2state which presents ratio comparison between city/town rate and state rate outside given geographic area have made this an extremely valuable dataset for further analysis
Research Ideas
- Developing a crime prediction and prevention program that uses machine learning models to identify criminal hotspots and direct resources to those areas
- Exploring the connection between race/ethnicity and rates of violence in California
- Creating visualizations and interactive maps to display types of violent crime across different counties within California
Acknowledgements
If you use this dataset in your research, please credit the original authors.
Data Source
License
License: Open Database License (ODbL) v1.0
- You are free to:
- Share - copy and redistribute the material in any medium or format.
- Adapt - remix, transform, and build upon the material for any purpose, even commercially.
- You must:
- Give appropriate credit - Provide a link to the license, and indicate if changes were made.
- ShareAlike - You must distribute your contributions under the same license as the original.
- Keep intact - all notices that refer to this license, including copyright notices.
- No Derivatives - If you remix, transform, or build upon the material, you may not distribute the modified material.
- No additional restrictions - You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Columns
File: Violent_Crime_Rate_California_2006-2010-DD.csv
File: rows.csv
Column name |
Description |
ind_definition |
Definition of the indicator. (String) |
reportyear |
Year the report was made. (Integer) |
race_eth_code |
Race/ethnicity codes. (Integer) |
race_eth_name |
Name of race/ethnicity. (String) |
geotype |
Type of geographic area. (String) |
geotypevalue |
Value of geographic area. (Integer) |
geoname |
Name of geographical location. (String) |
county_name |
Name of county. (String) |
county_fips |
County FIPS code. (Integer) |
region_name |
Name of region. (String) |
region_code |
Region code. (Integer) |
numerator |
Numerator for rate calculation. (Integer) |
denominator |
Denominator for rate calculation. (Integer) |
ratex1000 |
Violent crime count per 1,000 population. (Float) |
LL_95CI |
Lower 95% confidence interval. (Float) |
UL_95CI |
Upper 95% confidence interval. (Float) |
se |
Standard error for values for numerator/denominator measures. (Float) |
rse |
Relative standard error for measure ratex1000. (Float) |
CA_decile |
California decile index rank. (Integer) |
RR_city2state |
Ratio between city’s violent crime rate with states rate. (Float) |
Acknowledgements
If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit Health.