Sberbank Gender Prediction
Prediction of client gender on card transactions
@kaggle.nenriki_sberbank_gender_prediction
Prediction of client gender on card transactions
@kaggle.nenriki_sberbank_gender_prediction
One of the most valuable sources of customer information is bank transaction data. In this competition, participants are asked to answer the question: is it possible to predict the gender of a client using information about outcome transactions and income on a bank card? And if so, what is the accuracy of such a prediction?
transactions.csv – transactional data on banking operations.
train set.csv – training set with client gender marking (0/1 - client gender).
codes.csv – the table contains description of MCC transaction codes.
types.csv – table contains description of transactions.
test set.csv – data set with 2656 client id that is independent of the training data set, but that follows the same probability distribution.
client_id – client is id;
datetime – transaction date (format - ordered day number hh:mm:ss - 421 06:33:15);
code – MCC transaction code;
type – transaction type;
sum – sum of transactions: "+" — accrual of funds to the client (incoming transaction), "-" — write-off of funds (outcoming transaction);
client_id – client is id;
target – client gender;
code – MCC transaction code;
code_description – MCC transaction code description;
type – transaction type;
type_description – transaction type description;
client_id – client is id;
CREATE TABLE codes (
"code" BIGINT,
"code_description" VARCHAR
);CREATE TABLE test_set (
"cl" BIGINT,
"ent" VARCHAR,
"d" VARCHAR
);CREATE TABLE train_set (
"client_id" BIGINT,
"target" BIGINT
);CREATE TABLE transactions (
"client_id" BIGINT,
"datetime" VARCHAR,
"code" BIGINT,
"type" BIGINT,
"sum" DOUBLE
);CREATE TABLE types (
"type" BIGINT,
"type_description" VARCHAR
);Anyone who has the link will be able to view this.