Baselight

Insurance Dataset

Medical_Insurance cost dataset

@kaggle.sidraaazam_insurance_dataset

Loading...
Loading...

About this Dataset

Insurance Dataset

The Medical Insurance Cost dataset contains information about individuals’ demographic, lifestyle, and health-related factors, along with their corresponding medical insurance charges. It is typically used for predictive modeling, statistical analysis, and machine learning tasks such as regression.

Common Columns in the Dataset

Age Age of the insured person (in years).

Sex Gender of the insured individual (male/female).

BMI Body Mass Index, a measure of body fat based on height and weight.

Children Number of dependents covered by the insurance (e.g., 0, 1, 2, etc.).

Smoker Smoking status of the person (yes/no).

Region Residential area of the insured (e.g., northeast, northwest, southeast, southwest).

Charges Final medical insurance cost billed by the insurance company

Purpose of the Dataset

To analyze the factors influencing health insurance costs.

To build regression models predicting insurance charges.

To understand the relationship between lifestyle (like smoking, BMI) and medical expenses.

Useful for actuarial science, healthcare analytics, and machine learning projects.**

Tables

Insurance

@kaggle.sidraaazam_insurance_dataset.insurance
  • 24.54 kB
  • 1,338 rows
  • 7 columns
Loading...
CREATE TABLE insurance (
  "age" BIGINT,
  "sex" VARCHAR,
  "bmi" DOUBLE,
  "children" BIGINT,
  "smoker" VARCHAR,
  "region" VARCHAR,
  "charges" DOUBLE
);

Share link

Anyone who has the link will be able to view this.