Predicting The Past: Titanic Survival With ML
Supervised Learning Dataset for Binary Classification Using Real Historical Data
@kaggle.abhishekkkumark_titanic_survival_analysis_dataset
Supervised Learning Dataset for Binary Classification Using Real Historical Data
@kaggle.abhishekkkumark_titanic_survival_analysis_dataset
The Titanic Passenger Survival Prediction Dataset is one of the most well-known beginner datasets for learning machine learning and data science. It is based on real data collected from the tragic RMS Titanic disaster that occurred on April 15, 1912, where over 1,500 of the approximately 2,224 passengers and crew perished.
Exploratory Data Analysis (EDA)
Data cleaning and feature engineering
Model training, validation, and prediction
Evaluation metrics for classification problems
CREATE TABLE gender_analysis_dataset (
"passengerid" BIGINT,
"survived" BIGINT
);
CREATE TABLE test (
"passengerid" BIGINT,
"pclass" BIGINT,
"name" VARCHAR,
"sex" VARCHAR,
"age" DOUBLE,
"sibsp" BIGINT,
"parch" BIGINT,
"ticket" VARCHAR,
"fare" DOUBLE,
"cabin" VARCHAR,
"embarked" VARCHAR
);
CREATE TABLE train (
"passengerid" BIGINT,
"survived" BIGINT,
"pclass" BIGINT,
"name" VARCHAR,
"sex" VARCHAR,
"age" DOUBLE,
"sibsp" BIGINT,
"parch" BIGINT,
"ticket" VARCHAR,
"fare" DOUBLE,
"cabin" VARCHAR,
"embarked" VARCHAR
);
Anyone who has the link will be able to view this.