Baselight

Scotland's Health, Housing And Crime Statistics

Exploring Multifaceted Issues with Machine Learning

@kaggle.thedevastator_scotland_s_health_housing_and_crime_statistics

Loading...
Loading...

About this Dataset

Scotland's Health, Housing And Crime Statistics


Scotland's Health, Housing and Crime Statistics

Exploring Multifaceted Issues with Machine Learning

By [source]


About this dataset

This dataset of 60 Scottish statistical indicators offers the opportunity to explore open government data with machine learning, covering 7 categories – health, social care, housing and crime and justice. Drawn from a total of 6,976 “2011 data zones” in Scotland that provide a variety of information from 2015, it delves into the intricate details of local populations to reveal potential insights. With features on crucial measures such as travel times by car and public transport, chimney fires ratio and educational attainment scores - this dataset provides a rich source of reliable statistics for use in business analysis or policy making. Uncover trends through the exploration of Scottish socio-economic conditions at both an individual and communal scale!

More Datasets

For more datasets, click here.

Featured Notebooks

  • 🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset, entitled Scotland’s Health, Housing and Crime Statistics provides an integrated set of 60 Scottish statistical indicators from seven categories - health, social care, housing, and crime and justice - derived from a total of 6,976 “2011 data zones” in Scotland.

To assist you in exploring these data points we have provided a guide on how to approach and make effective use of the information available.

  1. Read through the descriptions of each indicator carefully to understand what each measure is assessing. This will help you determine which indicators may be more relevant to your research topic or project objectives.

  2. Consider the time period covered by the dataset – this gives you an indication as to how up-to-date or recent these measures are likely to be; it also provides context for interpreting their accuracy within a given frame of reference e.g., if the measures are captured across five years then they represent changes over time rather than snapshots in one particular year only).

  3. Compare different indicators – look for correlations between variables that suggest either higher incidents or risk factors being related in some way or other; likewise bear any traditional assumptions associated with certain topics at bay as there may possibly exist divergent patterns in this analysis that challenge existing ideas about certain subject matter areas (this could offer significant insight into new insights for further research investigations).

  4. Use visualisations when exploring your data points; graphical representations can often demonstrate patterns intuitively which helps paint a broader picture around key themes within your research question(s) that might explain causation issues such as root causes etc.. .

  5. Explore regional differences too – breaking down aggregate measures into subcomponents like geography (e..g province/region/state level), gender etc., can provide potential areas where localised interventions can be implemented with greater focus moving forward according to differing needs arising across different places/groups with different characteristics associated on them (eag population health disparities exist). Furthermore when exploring regional differences you should also take temporal aspects into consideration e..g whether levels measured today vary significantly from comparable numbers observed during previous periods or vice versa)

Research Ideas

  • Identifying correlations between different statistical indicators to develop an overall health report for each data zone in Scotland. By analyzing both health and housing indicators, a comprehensive view of the living conditions in each area can be obtained, providing the government with insights for appropriate policy interventions.
  • Predictive modeling to analyze future crime trends based on existing crime data from various locations across Scotland. This could help governments plan better security measures and allocate resources more effectively in order to protect their population more efficiently.
  • Extensive use of machine learning algorithms such as clustering and classification on this dataset could give insight into whether certain indicator values are predictors of other values or not, which could then be used directly by governments when making economic policies associated with these values (e.g., housing prices). Furthermore, a comparison between actual outcomes with those predicted by models based on the dataset could be done easily to adjust policies appropriately according to real-time findings if needed

Acknowledgements

If you use this dataset in your research, please credit the original authors.
Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: statistical_indicators.csv

Column name Description
Data Zone Unique identifier for each data zone. (String)
House Prices Average house prices in the data zone. (Numeric)
Travel times (minutes) to GP surgeries by public transport Average travel time by public transport to GP surgeries in the data zone. (Numeric)
Travel times (minutes) to post office by public transport Average travel time by public transport to post offices in the data zone. (Numeric)
Travel times (minutes) to retail centre by public transport Average travel time by public transport to retail centres in the data zone. (Numeric)
Travel times (minutes) to GP surgeries by car Average travel time by car to GP surgeries in the data zone. (Numeric)
Travel times (minutes) to petrol station by car Average travel time by car to petrol stations in the data zone. (Numeric)
Travel times (minutes) to post office by car Average travel time by car to post offices in the data zone. (Numeric)
Travel times (minutes) to primary school by car Average travel time by car to primary schools in the data zone. (Numeric)
Travel times (minutes) to secondary school by car Average travel time by car to secondary schools in the data zone. (Numeric)
Travel times (minutes) to retail centre by car Average travel time by car to retail centres in the data zone. (Numeric)
Chimney fires (ratio) Ratio of chimney fires in the data zone. (Numeric)
Dwelling fires (ratio) Ratio of dwelling fires in the data zone. (Numeric)
Other building fires (ratio) Ratio of other building fires in the data zone. (Numeric)
Other primary fires (ratio) Ratio of other primary fires in the data zone. (Numeric)
Outdoor fires (ratio) Ratio of outdoor fires in the data zone. (Numeric)
Refuse fires (ratio) Ratio of refuse fires in the data zone. (Numeric)
Vehicle fires (ratio) Ratio of vehicle fires in the data zone. (Numeric)
Educational attainment of school leavers (score) Average educational attainment score of school leavers in the data zone. (Numeric)
School attendance (ratio) Ratio of school attendance in the data zone. (Numeric)
Land area (in hectares) Total land area of the data zone. (Numeric)
Urban Rural Classification Classification of the data zone as urban or rural. (String)
Age of first time mothers 19 years and under (ratio) Ratio of first time mothers aged 19 years and under in the data zone. (Numeric)
Age of first time mothers 35 years and older (ratio) Ratio of first time mothers aged 35 years and older in the data zone. (Numeric)
Vacant households (ratio) Ratio of vacant households in the data zone. (Numeric)
Households with occupied exemptions (ratio) Ratio of households with occupied exemptions in the data zone. (Numeric)
Households with unoccupied exemptions (ratio) Ratio of households with unoccupied exemptions in the data zone. (Numeric)
Households with single adult discounts (ratio) Ratio of households with single adult discounts in the data zone. (Numeric)
Crime indicators (ratio) Ratio of crime indicators in the data zone. (Numeric)
Employment deprivation (ratio) Ratio of employment deprivation in the data zone. (Numeric)
Comparative Illness Factor Compar
Children 0-15 living in low income families (ratio) Ratio of children aged 0-15 living in low income families in the data zone. (Numeric)
Children 0-19 living in low income families (ratio) Ratio of children aged 0-19 living in low income families in the data zone. (Numeric)
Mothers currently smoking (ratio) Ratio of mothers currently smoking in the data zone. (Numeric)
Mothers former smokers (ratio) Ratio of mothers who are former smokers in the data zone. (Numeric)
Mothers never smoked (ratio) Ratio of mothers who have never smoked in the data zone. (Numeric)
Mothers not known if they smoke (ratio) Ratio
Low birthweight (less than 2500g) babies (single births) (ratio) Ratio of low birthweight babies (less than 2500g) in single births in the data zone. (Numeric)
Dwellings per hectare (ratio) Ratio of dwellings per hectare in the data zone. (Numeric)
Detached dwellings (ratio) Ratio of detached dwellings in the data zone. (Numeric)
Flats (ratio) Ratio of flats in the data zone. (Numeric)
Semi-detached dwellings (ratio) Ratio of semi-detached dwellings in the data zone. (Numeric)
Terraced dwellings (ratio) Ratio of terraced dwellings in the data zone. (
Dwellings of unknown type (ratio) Ratio of dwellings of unknown type in the data zone. (Numeric)
Long-term empty households (ratio) Ratio of long-term empty households in the data zone. (Numeric)
Occupied households (ratio) Ratio of occupied households in the data zone. (Numeric)
Second-home households (ratio) Ratio of second-home households in the data zone. (Numeric)

Acknowledgements

If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit .

Tables

Statistical Indicators

@kaggle.thedevastator_scotland_s_health_housing_and_crime_statistics.statistical_indicators
  • 698.38 KB
  • 6976 rows
  • 62 columns
Loading...

CREATE TABLE statistical_indicators (
  "unnamed_0" BIGINT,
  "data_zone" VARCHAR,
  "house_prices" DOUBLE,
  "travel_times_minutes_to_gp_surgeries_by_public_transport" DOUBLE,
  "travel_times_minutes_to_post_office_by_public_transport" DOUBLE,
  "travel_times_minutes_to_retail_centre_by_public_transport" DOUBLE,
  "travel_times_minutes_to_gp_surgeries_by_car" DOUBLE,
  "travel_times_minutes_to_petrol_station_by_car" DOUBLE,
  "travel_times_minutes_to_post_office_by_car" DOUBLE,
  "travel_times_minutes_to_primary_school_by_car" DOUBLE,
  "travel_times_minutes_to_secondary_school_by_car" DOUBLE,
  "travel_times_minutes_to_retail_centre_by_car" DOUBLE,
  "chimney_fires_ratio" DOUBLE,
  "dwelling_fires_ratio" DOUBLE,
  "other_building_fires_ratio" DOUBLE,
  "other_primary_fires_ratio" DOUBLE,
  "outdoor_fires_ratio" DOUBLE,
  "refuse_fires_ratio" DOUBLE,
  "vehicle_fires_ratio" DOUBLE,
  "accidental_chimney_fires_ratio" DOUBLE,
  "accidental_dwelling_fires_ratio" DOUBLE,
  "accidental_other_building_fires_ratio" DOUBLE,
  "accidental_other_primary_fires_ratio" DOUBLE,
  "accidental_outdoor_fires_ratio" DOUBLE,
  "accidental_refuse_fires_ratio" DOUBLE,
  "accidental_vehicle_fires_ratio" DOUBLE,
  "not_accidental_chimney_fires_ratio" DOUBLE,
  "not_accidental_dwelling_fires_ratio" DOUBLE,
  "not_accidental_other_building_fires_ratio" DOUBLE,
  "not_accidental_other_primary_fires_ratio" DOUBLE,
  "not_accidental_outdoor_fires_ratio" DOUBLE,
  "not_accidental_refuse_fires_ratio" DOUBLE,
  "not_accidental_vehicle_fires_ratio" DOUBLE,
  "children_0_15_living_in_low_income_families_ratio" DOUBLE,
  "children_0_19_living_in_low_income_families_ratio" DOUBLE,
  "educational_attainment_of_school_leavers_score" DOUBLE,
  "school_attendance_ratio" DOUBLE,
  "land_area_in_hectares" DOUBLE,
  "urban_rural_classification" DOUBLE,
  "age_of_first_time_mothers_19_years_and_under_ratio" DOUBLE,
  "age_of_first_time_mothers_35_years_and_older_ratio" DOUBLE,
  "mothers_currently_smoking_ratio" DOUBLE,
  "mothers_former_smokers_ratio" DOUBLE,
  "mothers_never_smoked_ratio" DOUBLE,
  "low_birthweight_less_than_2500g_babies_single_births_ratio" DOUBLE,
  "dwellings_per_hectare_ratio" DOUBLE,
  "detached_dwellings_ratio" DOUBLE,
  "flats_ratio" DOUBLE,
  "semi_detached_dwellings_ratio" DOUBLE,
  "terraced_dwellings_ratio" DOUBLE,
  "dwellings_of_unknown_type_ratio" DOUBLE,
  "mothers_not_known_if_they_smoke_ratio" DOUBLE,
  "long_term_empty_households_ratio" DOUBLE,
  "occupied_households_ratio" DOUBLE,
  "second_home_households_ratio" DOUBLE,
  "vacant_households_ratio" DOUBLE,
  "households_with_occupied_exemptions_ratio" DOUBLE,
  "households_with_unoccupied_exemptions_ratio" DOUBLE,
  "households_with_single_adult_discounts_ratio" DOUBLE,
  "crime_indicators_ratio" DOUBLE,
  "employment_deprivation_ratio" DOUBLE,
  "comparative_illness_factor" DOUBLE
);

Share link

Anyone who has the link will be able to view this.