Baselight

Balanced Predictive Maintenance Dataset

Addressing Class Imbalance in Predictive Maintenance

@kaggle.chetansmahale_balanced_predictive_maintenance_dataset

About this Dataset

Balanced Predictive Maintenance Dataset

The AI4I 2020 Predictive Maintenance Dataset (Link) is imbalanced. To address the data imbalance challenge, this dataset was augmented using statistical technique - SMOTE (Synthetic Minority Oversampling Technique) and GenAI based technique - CTAGN (Conditional Tabular Generative Adversarial Network).
This dataset contains 3 directories.

  1. First one contains the cleaned AI4I 2020 Predictive Maintenance Dataset as well as train - test splitted datasets.
  2. Second one contains X and y of the data augmented with CTGAN
  3. Third one contains X and y of the data augmented with SMOTE

Share link

Anyone who has the link will be able to view this.