Insurance Dataset
Medical_Insurance cost dataset
@kaggle.sidraaazam_insurance_dataset
Medical_Insurance cost dataset
@kaggle.sidraaazam_insurance_dataset
The Medical Insurance Cost dataset contains information about individuals’ demographic, lifestyle, and health-related factors, along with their corresponding medical insurance charges. It is typically used for predictive modeling, statistical analysis, and machine learning tasks such as regression.
Common Columns in the Dataset
Age Age of the insured person (in years).
Sex Gender of the insured individual (male/female).
BMI Body Mass Index, a measure of body fat based on height and weight.
Children Number of dependents covered by the insurance (e.g., 0, 1, 2, etc.).
Smoker Smoking status of the person (yes/no).
Region Residential area of the insured (e.g., northeast, northwest, southeast, southwest).
Charges Final medical insurance cost billed by the insurance company
Purpose of the Dataset
To analyze the factors influencing health insurance costs.
To build regression models predicting insurance charges.
To understand the relationship between lifestyle (like smoking, BMI) and medical expenses.
Useful for actuarial science, healthcare analytics, and machine learning projects.**
CREATE TABLE insurance (
"age" BIGINT,
"sex" VARCHAR,
"bmi" DOUBLE,
"children" BIGINT,
"smoker" VARCHAR,
"region" VARCHAR,
"charges" DOUBLE
);
Anyone who has the link will be able to view this.