Wine Dataset
Normalizing Wine Data for Cleaner Insights & Better Model Performance 🍷📊
@kaggle.huzaifaansariwithpy_wine_dataset
Normalizing Wine Data for Cleaner Insights & Better Model Performance 🍷📊
@kaggle.huzaifaansariwithpy_wine_dataset
🍷 About the Wine Dataset:
The Wine dataset is a classic multiclass classification dataset from the UCI Machine Learning Repository. It contains 13 chemical properties of wines derived from three different cultivars. Features include alcohol content, malic acid, ash, magnesium, phenols, and more.
In this notebook, I explored 12 key features, visualized distributions, handled outliers, and applied MinMax Scaling to normalize the data. Scaling ensures that all features contribute equally to machine learning models, improving stability and performance.
CREATE TABLE wine_dataset (
"class" BIGINT,
"alcohol" DOUBLE,
"malic_acid" DOUBLE,
"ash" DOUBLE,
"alcalinity_of_ash" DOUBLE,
"magnesium" BIGINT,
"total_phenols" DOUBLE,
"flavanoids" DOUBLE,
"nonflavanoid_phenols" DOUBLE,
"proanthocyanins" DOUBLE,
"color_intensity" DOUBLE,
"hue" DOUBLE,
"od280_od315_of_diluted_wines" DOUBLE,
"proline" BIGINT
);
Anyone who has the link will be able to view this.