Addressing Class Imbalance in Predictive Maintenance
Dataset Description
The AI4I 2020 Predictive Maintenance Dataset (Link) is imbalanced. To address the data imbalance challenge, this dataset was augmented using statistical technique - SMOTE (Synthetic Minority Oversampling Technique) and GenAI based technique - CTAGN (Conditional Tabular Generative Adversarial Network).
This dataset contains 3 directories.
- First one contains the cleaned AI4I 2020 Predictive Maintenance Dataset as well as train - test splitted datasets.
- Second one contains X and y of the data augmented with CTGAN
- Third one contains X and y of the data augmented with SMOTE
Related Datasets
-
Eucalyptus Growth And Environmental Data
@euremarkable