Balanced Predictive Maintenance Dataset
Addressing Class Imbalance in Predictive Maintenance
@kaggle.chetansmahale_balanced_predictive_maintenance_dataset
Addressing Class Imbalance in Predictive Maintenance
@kaggle.chetansmahale_balanced_predictive_maintenance_dataset
The AI4I 2020 Predictive Maintenance Dataset (Link) is imbalanced. To address the data imbalance challenge, this dataset was augmented using statistical technique - SMOTE (Synthetic Minority Oversampling Technique) and GenAI based technique - CTAGN (Conditional Tabular Generative Adversarial Network).
This dataset contains 3 directories.
CREATE TABLE cleaned_data (
"type" BIGINT,
"air_temperature_k" DOUBLE -- Air Temperature [K],
"process_temperature_k" DOUBLE -- Process Temperature [K],
"rotational_speed_rpm" DOUBLE -- Rotational Speed [rpm],
"torque_nm" DOUBLE -- Torque [Nm],
"tool_wear_min" DOUBLE -- Tool Wear [min],
"reason_of_failure" BIGINT
);
CREATE TABLE x_ctgan (
"type" BIGINT,
"air_temperature_k" DOUBLE -- Air Temperature [K],
"process_temperature_k" DOUBLE -- Process Temperature [K],
"rotational_speed_rpm" DOUBLE -- Rotational Speed [rpm],
"torque_nm" DOUBLE -- Torque [Nm],
"tool_wear_min" DOUBLE -- Tool Wear [min]
);
CREATE TABLE x_smote (
"type" BIGINT,
"air_temperature_k" DOUBLE -- Air Temperature [K],
"process_temperature_k" DOUBLE -- Process Temperature [K],
"rotational_speed_rpm" DOUBLE -- Rotational Speed [rpm],
"torque_nm" DOUBLE -- Torque [Nm],
"tool_wear_min" DOUBLE -- Tool Wear [min]
);
CREATE TABLE x_test (
"type" BIGINT,
"air_temperature_k" DOUBLE -- Air Temperature [K],
"process_temperature_k" DOUBLE -- Process Temperature [K],
"rotational_speed_rpm" DOUBLE -- Rotational Speed [rpm],
"torque_nm" DOUBLE -- Torque [Nm],
"tool_wear_min" DOUBLE -- Tool Wear [min]
);
CREATE TABLE x_train (
"type" BIGINT,
"air_temperature_k" DOUBLE -- Air Temperature [K],
"process_temperature_k" DOUBLE -- Process Temperature [K],
"rotational_speed_rpm" DOUBLE -- Rotational Speed [rpm],
"torque_nm" DOUBLE -- Torque [Nm],
"tool_wear_min" DOUBLE -- Tool Wear [min]
);
CREATE TABLE y_ctgan (
"reason" BIGINT,
"of" VARCHAR,
"failure" VARCHAR
);
CREATE TABLE y_smote (
"reason" BIGINT,
"of" VARCHAR,
"failure" VARCHAR
);
CREATE TABLE y_test (
"reason" BIGINT,
"of" VARCHAR,
"failure" VARCHAR
);
CREATE TABLE y_train (
"reason" BIGINT,
"of" VARCHAR,
"failure" VARCHAR
);
Anyone who has the link will be able to view this.