Glass Classification
Can you correctly identify glass type?
@kaggle.uciml_glass
Can you correctly identify glass type?
@kaggle.uciml_glass
This is a Glass Identification Data Set from UCI. It contains 10 attributes including id. The response is glass type(discrete 7 values)
Attribute Information:
https://archive.ics.uci.edu/ml/datasets/Glass+Identification
Source:
Creator:
B. German
Central Research Establishment
Home Office Forensic Science Service
Aldermaston, Reading, Berkshire RG7 4PN
Donor:
Vina Spiehler, Ph.D., DABFT
Diagnostic Products Corporation
(213) 776-0180 (ext 3014)
Data exploration of this dataset reveals two important characteristics :
The variables are highly corelated with each other including the response variables:
So which kind of ML algorithm is most suitable for this dataset Random Forest , KNN or other? Also since dataset is too small is there any chance of applying PCA or it should be completely avoided?
Highly Skewed Data:
Is scaling sufficient or are there any other techniques which should be applied to normalize data? Like BOX-COX Power transformation?
CREATE TABLE glass (
"ri" DOUBLE,
"na" DOUBLE,
"mg" DOUBLE,
"al" DOUBLE,
"si" DOUBLE,
"k" DOUBLE,
"ca" DOUBLE,
"ba" DOUBLE,
"fe" DOUBLE,
"type" BIGINT
);Anyone who has the link will be able to view this.