Baselight

Insurance Data For Machine Learning

An Insurance Dataset for Predicting Health Insurance Premiums in the US: A Study

@kaggle.sridharstreaks_insurance_data_for_machine_learning

Insurance Dataset
@kaggle.sridharstreaks_insurance_data_for_machine_learning.insurance_dataset

  • 12.96 MB
  • 1000000 rows
  • 12 columns
age

Age

gender

Gender

bmi

Bmi

children

Children

smoker

Smoker

region

Region

medical_history

Medical History

family_medical_history

Family Medical History

exercise_frequency

Exercise Frequency

occupation

Occupation

coverage_level

Coverage Level

charges

Charges

46male21.455yessoutheastDiabetesnanNeverBlue collarPremium20460.307668871566
25female25.382yesnorthwestDiabetesHigh blood pressureOccasionallyWhite collarPremium20390.899217642196
38male44.882yessouthwestnanHigh blood pressureOccasionallyBlue collarPremium20204.476301934814
25male19.89nonorthwestnanDiabetesRarelyWhite collarStandard11789.029842697417
49male38.213yesnorthwestDiabetesHigh blood pressureRarelyWhite collarStandard19268.309838159606
55female36.41yesnortheastnannanNeverStudentBasic11896.836612606394
64female20.122nonortheastHigh blood pressureHigh blood pressureNeverBlue collarBasic9563.65501067933
53male30.514nosoutheastHeart diseaseHigh blood pressureRarelyStudentStandard15845.29372985211
40female44.932yesnortheastnanDiabetesOccasionallyUnemployedBasic14036.544128779233
22female32.135yesnortheastDiabetesnanNeverStudentBasic13669.577830240827

CREATE TABLE insurance_dataset (
  "age" BIGINT,
  "gender" VARCHAR,
  "bmi" DOUBLE,
  "children" BIGINT,
  "smoker" VARCHAR,
  "region" VARCHAR,
  "medical_history" VARCHAR,
  "family_medical_history" VARCHAR,
  "exercise_frequency" VARCHAR,
  "occupation" VARCHAR,
  "coverage_level" VARCHAR,
  "charges" DOUBLE
);

Share link

Anyone who has the link will be able to view this.