Dataset Description
This is a simple variant of the titanic survival Dataset.
The main difference is the presence of only the honorifics of people instead of their name. This feature should be more easy to use. Only 4 Honorifics are retained and more uncommon ones are grouped inside a new "Rare" honorific.
2 columns substitutes the Cabin column. We have the code of the first cabin in the column and the number of cabins in that column. If the Cabin is not available we insert a new cabin category "N" (Not available).
Lastly there is an indicator column for missing age values and the missing values are filled with a -1.
Open to feedback and suggestions
Related Datasets
-
Titanic
@kaggle
-
Documentos Sempre Válidos
@ptgov
-
Documento Único Automóvel
@ptgov