Baselight

Heart Attack In Youth Of India

Heart Attack Risk Among Young Indians: Lifestyle, Medical, and Clinical Insights

@kaggle.ankushpanday1_heart_attack_in_youth_of_india

About this Dataset

Heart Attack In Youth Of India

The dataset contains data on factors influencing heart attack likelihood among youngsters in India (ages 18-35). It captures demographic, lifestyle, medical, and clinical test information. This rich dataset is designed to help explore the connections between these factors and heart attack risk in the young population.

Key Features
Demographics

Age, Gender, Region, Urban/Rural residence, Socioeconomic Status (SES).
Lifestyle Factors

Smoking and alcohol consumption, dietary preferences, physical activity, screen time, sleep duration.
Medical History

Family history of heart disease, diabetes, hypertension, cholesterol levels, body mass index (BMI), stress levels.
Clinical and Test Results

Blood pressure, resting heart rate, ECG results, chest pain type, maximum heart rate, exercise-induced angina, blood oxygen levels (SpO2%), triglyceride levels.
Target Variable

Heart Attack Likelihood (Yes/No).

Use Cases
Machine Learning for Risk Prediction

Train classification models (e.g., logistic regression, random forests, or neural networks) to predict heart attack likelihood.
Public Health Insights

Identify high-risk groups (e.g., sedentary individuals with a family history of heart disease) for targeted interventions.
Lifestyle Recommendations

Study the impact of modifiable lifestyle factors like diet, exercise, and sleep on heart attack risk.
Regional and Socioeconomic Analysis

Explore the disparities in heart attack likelihood across regions and socioeconomic statuses in India.

Insights to Extract
Feature Interactions

Analyze non-linear relationships, such as how "Physical Activity" modifies the effect of "BMI" on heart attack risk.
Example: A high BMI may pose less risk for individuals with high physical activity.
Cluster Analysis

Perform clustering (e.g., K-Means, DBSCAN) to identify subgroups of individuals with similar risk profiles.
Latent Factors

Use dimensionality reduction techniques like PCA to uncover latent health patterns influencing heart attack risk.
Causal Relationships

Apply causal inference techniques to explore causation (e.g., does high stress directly lead to higher cholesterol levels?).
Regional Variance

Conduct geospatial analysis to study how environmental or cultural factors in different regions affect lifestyle and medical indicators.
Example Insights to Uncover
Stress and Sleep Interaction

Do high-stress individuals with less sleep show disproportionately higher risk for heart attacks?
Exercise-Induced Angina

Explore if exercise-induced angina correlates more with individuals having abnormal ECG results and high cholesterol.
Gender-Specific Risk Factors

Compare risk factor significance (e.g., smoking or BMI) between genders.
Dietary Impact

Assess the differential impact of vegetarian vs. non-vegetarian diets on triglyceride levels and heart attack risk.
Hidden Patterns

Discover rare but significant patterns (e.g., low SpO2 and normal BMI but high heart attack likelihood).
Starting Point for Insights
Correlation Analysis:
Check relationships between features and the target variable.

Predictive Modeling:
Implement a Random Forest or Gradient Boosting model and examine feature importance.

Time Series or Trends:
If extended to include time data, study trends in cholesterol and stress over time for individuals.

Share link

Anyone who has the link will be able to view this.