Baselight

Predicting The Past: Titanic Survival With ML

Supervised Learning Dataset for Binary Classification Using Real Historical Data

@kaggle.abhishekkkumark_titanic_survival_analysis_dataset

Loading...
Loading...

About this Dataset

Predicting The Past: Titanic Survival With ML

The Titanic Passenger Survival Prediction Dataset is one of the most well-known beginner datasets for learning machine learning and data science. It is based on real data collected from the tragic RMS Titanic disaster that occurred on April 15, 1912, where over 1,500 of the approximately 2,224 passengers and crew perished.
Exploratory Data Analysis (EDA)

Data cleaning and feature engineering

Model training, validation, and prediction

Evaluation metrics for classification problems

Tables

Train

@kaggle.abhishekkkumark_titanic_survival_analysis_dataset.train
  • 42.28 KB
  • 891 rows
  • 12 columns
Loading...

CREATE TABLE train (
  "passengerid" BIGINT,
  "survived" BIGINT,
  "pclass" BIGINT,
  "name" VARCHAR,
  "sex" VARCHAR,
  "age" DOUBLE,
  "sibsp" BIGINT,
  "parch" BIGINT,
  "ticket" VARCHAR,
  "fare" DOUBLE,
  "cabin" VARCHAR,
  "embarked" VARCHAR
);

Gender Analysis Dataset

@kaggle.abhishekkkumark_titanic_survival_analysis_dataset.gender_analysis_dataset
  • 4.52 KB
  • 418 rows
  • 2 columns
Loading...

CREATE TABLE gender_analysis_dataset (
  "passengerid" BIGINT,
  "survived" BIGINT
);

Test

@kaggle.abhishekkkumark_titanic_survival_analysis_dataset.test
  • 24.88 KB
  • 418 rows
  • 11 columns
Loading...

CREATE TABLE test (
  "passengerid" BIGINT,
  "pclass" BIGINT,
  "name" VARCHAR,
  "sex" VARCHAR,
  "age" DOUBLE,
  "sibsp" BIGINT,
  "parch" BIGINT,
  "ticket" VARCHAR,
  "fare" DOUBLE,
  "cabin" VARCHAR,
  "embarked" VARCHAR
);

Share link

Anyone who has the link will be able to view this.