Baselight

Divorce/marriage Dataset With Birth Dates

Divorce data with the birth, divorce, marriage dates and many other info

@kaggle.aagghh_divorcemarriage_dataset_with_birth_dates

About this Dataset

Divorce/marriage Dataset With Birth Dates

Context

This dataset was obtained from the https://datos.gob.mx/busca/dataset/registro-civil

This is the Mexican government official dataset for the number of divorces in the city of Xalapa, Mexico

Content

The dataset contains records of approximately 4,900+ divorces for the 15 years period (2000-2015) in the city of Xalapa, Mexico. The special good thing about this dataset is that it contains divorcees birth dates which are usually considered as being sensitive information and usually not included in the public datasets.

The dataset is originally in Spanish and I did translate all the column headers into English. Files are as follows:

divorces_2000-2015_original.csv - original dataset (in Spanish)
descriptions_for_ column.csv - descriptions for each column (in Spanish)
divorces_2000-2015_translated.csv - the version with the English translated column headers (please note that only column headers were translated)
comp_matrix.csv - the table of the zodiac signs compatibility rates that were used in my notebook (https://www.kaggle.com/aagghh/testing-the-astrology-and-zodiac-claims). Note: ignore it if you do not need it

Major features are:

Date of divorce

Birth dates for both partners (man/woman)

Nationality for both partners (man/woman)

Place of birth and residence for both partners (man/woman)

Monthly income for both partners (man/woman)

Occupation for both partners (man/woman)

Date of marriage (man/woman)

Level of education for both partners (man/woman)

Employment status for both partners (man/woman)

Number of children and their custody

Other features - please refer to the file & columns descriptions below

Inspiration

A potential use-case for this data could be a practice in classification/clustering problems in an attempt to predict a divorce.

Some other data analysis can be applied to this dataset. For instance, I am gonna use this data for the horoscope/zodiac/astrology claims validation/testing.

Please refer to my notebook on it here: https://www.kaggle.com/aagghh/is-astrology-right-testing-the-zodiac-claims

Share link

Anyone who has the link will be able to view this.