Baselight

Human Age Prediction Synthetic Dataset

Synthetic Data for Predicting Age Based on Health and Lifestyle Factors

@kaggle.abdullah0a_human_age_prediction_synthetic_dataset

Loading...
Loading...

About this Dataset

Human Age Prediction Synthetic Dataset

Title: Dataset for Age Prediction
Task : Age Prediction (Regression)
Description:

This dataset contains synthetic data designed for predicting age based on various health and lifestyle factors. It includes 3,000 rows with 24 features, each representing different aspects of physical health and lifestyle.

Features:

  • Height (cm): The height of the individual in centimeters.
  • Weight (kg): The weight of the individual in kilograms.
  • Blood Pressure (s/d): Blood pressure (systolic/diastolic) in mmHg.
  • Cholesterol Level (mg/dL): Cholesterol level in milligrams per deciliter.
  • BMI: Body Mass Index, calculated from height and weight.
  • Blood Glucose Level (mg/dL): Blood glucose level in milligrams per deciliter.
  • Bone Density (g/cm²): Bone density in grams per square centimeter.
  • Vision Sharpness: Vision sharpness on a scale from 0 (blurry) to 100 (perfect).
  • Hearing Ability (dB): Hearing ability in decibels.
  • Physical Activity Level: Categorized as 'Low', 'Moderate', or 'High'.
  • Smoking Status: Categorical values including 'Never', 'Former', and 'Current'.
  • Alcohol Consumption: Frequency of alcohol consumption.
  • Diet: Type of diet, categorized as 'Balanced', 'High Protein', 'Low Carb', etc.
  • Chronic Diseases: Presence of chronic diseases (e.g., diabetes, hypertension).
  • Medication Use: Usage of medication.
  • Family History: Presence of family history of age-related conditions.
  • Cognitive Function: Self-reported cognitive function on a scale from 0 (poor) to 100 (excellent).
  • Mental Health Status: Self-reported mental health status on a scale from 0 (poor) to 100 (excellent).
  • Sleep Patterns: Average number of sleep hours per night.
  • Stress Levels: Self-reported stress levels on a scale from 0 (low) to 100 (high).
  • Pollution Exposure: Exposure to pollution measured in arbitrary units.
  • Sun Exposure: Average sun exposure in hours per week.
  • Education Level: Highest level of education attained.
  • Income Level: Annual income in USD.
  • Age (years): The target variable representing the age of the individual.

Use Cases:
Ideal for machine learning models aimed at age prediction based on health and lifestyle factors. Suitable for exploring relationships between health metrics and age.

Tables

Test

@kaggle.abdullah0a_human_age_prediction_synthetic_dataset.test
  • 377.44 kB
  • 3,000 rows
  • 25 columns
Loading...
CREATE TABLE test (
  "gender" VARCHAR,
  "height_cm" DOUBLE  -- Height (cm),
  "weight_kg" DOUBLE  -- Weight (kg),
  "blood_pressure_s_d" VARCHAR  -- Blood Pressure (s/d),
  "cholesterol_level_mg_dl" DOUBLE  -- Cholesterol Level (mg/dL),
  "bmi" DOUBLE,
  "blood_glucose_level_mg_dl" DOUBLE  -- Blood Glucose Level (mg/dL),
  "bone_density_g_cm" DOUBLE  -- Bone Density (g/cm²),
  "vision_sharpness" DOUBLE,
  "hearing_ability_db" DOUBLE  -- Hearing Ability (dB),
  "physical_activity_level" VARCHAR,
  "smoking_status" VARCHAR,
  "alcohol_consumption" VARCHAR,
  "diet" VARCHAR,
  "chronic_diseases" VARCHAR,
  "medication_use" VARCHAR,
  "family_history" VARCHAR,
  "cognitive_function" DOUBLE,
  "mental_health_status" VARCHAR,
  "sleep_patterns" VARCHAR,
  "stress_levels" DOUBLE,
  "pollution_exposure" DOUBLE,
  "sun_exposure" DOUBLE,
  "education_level" VARCHAR,
  "income_level" VARCHAR
);

Train

@kaggle.abdullah0a_human_age_prediction_synthetic_dataset.train
  • 381.06 kB
  • 3,000 rows
  • 26 columns
Loading...
CREATE TABLE train (
  "gender" VARCHAR,
  "height_cm" DOUBLE  -- Height (cm),
  "weight_kg" DOUBLE  -- Weight (kg),
  "blood_pressure_s_d" VARCHAR  -- Blood Pressure (s/d),
  "cholesterol_level_mg_dl" DOUBLE  -- Cholesterol Level (mg/dL),
  "bmi" DOUBLE,
  "blood_glucose_level_mg_dl" DOUBLE  -- Blood Glucose Level (mg/dL),
  "bone_density_g_cm" DOUBLE  -- Bone Density (g/cm²),
  "vision_sharpness" DOUBLE,
  "hearing_ability_db" DOUBLE  -- Hearing Ability (dB),
  "physical_activity_level" VARCHAR,
  "smoking_status" VARCHAR,
  "alcohol_consumption" VARCHAR,
  "diet" VARCHAR,
  "chronic_diseases" VARCHAR,
  "medication_use" VARCHAR,
  "family_history" VARCHAR,
  "cognitive_function" DOUBLE,
  "mental_health_status" VARCHAR,
  "sleep_patterns" VARCHAR,
  "stress_levels" DOUBLE,
  "pollution_exposure" DOUBLE,
  "sun_exposure" DOUBLE,
  "education_level" VARCHAR,
  "income_level" VARCHAR,
  "age_years" BIGINT  -- Age (years)
);

Share link

Anyone who has the link will be able to view this.