Baselight

Sberbank Gender Prediction

Prediction of client gender on card transactions

@kaggle.nenriki_sberbank_gender_prediction

About this Dataset

Sberbank Gender Prediction

Task description

One of the most valuable sources of customer information is bank transaction data. In this competition, participants are asked to answer the question: is it possible to predict the gender of a client using information about outcome transactions and income on a bank card? And if so, what is the accuracy of such a prediction?

File descriptions

transactions.csv – transactional data on banking operations.
train set.csv – training set with client gender marking (0/1 - client gender).
codes.csv – the table contains description of MCC transaction codes.
types.csv – table contains description of transactions.
test set.csv – data set with 2656 client id that is independent of the training data set, but that follows the same probability distribution.

Data fields

transactions.csv

client_id – client is id;
datetime – transaction date (format - ordered day number hh:mm:ss - 421 06:33:15);
code – MCC transaction code;
type – transaction type;
sum – sum of transactions: "+" — accrual of funds to the client (incoming transaction), "-" — write-off of funds (outcoming transaction);

train_set.csv

client_id – client is id;
target – client gender;

codes.csv

code – MCC transaction code;
code_description – MCC transaction code description;

types.csv

type – transaction type;
type_description – transaction type description;

test_set.csv

client_id – client is id;

Share link

Anyone who has the link will be able to view this.