Baselight

Banking Dataset - Marketing Targets

Banking Dataset of different customers to predict if they will convert or not.

@kaggle.prakharrathi25_banking_dataset_marketing_targets

Loading...
Loading...

About this Dataset

Banking Dataset - Marketing Targets

Context

Term deposits are a major source of income for a bank. A term deposit is a cash investment held at a financial institution. Your money is invested for an agreed rate of interest over a fixed amount of time, or term. The bank has various outreach plans to sell term deposits to their customers such as email marketing, advertisements, telephonic marketing, and digital marketing.

Telephonic marketing campaigns still remain one of the most effective way to reach out to people. However, they require huge investment as large call centers are hired to actually execute these campaigns. Hence, it is crucial to identify the customers most likely to convert beforehand so that they can be specifically targeted via call.

The data is related to direct marketing campaigns (phone calls) of a Portuguese banking institution. The classification goal is to predict if the client will subscribe to a term deposit (variable y).

Content

The data is related to the direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more than one contact to the same client was required, in order to access if the product (bank term deposit) would be ('yes') or not ('no') subscribed by the customer or not. The data folder contains two datasets:-

  • train.csv: 45,211 rows and 18 columns ordered by date (from May 2008 to November 2010)
  • test.csv: 4521 rows and 18 columns with 10% of the examples (4521), randomly selected from train.csv

Detailed Column Descriptions

bank client data:

1 - age (numeric)
2 - job : type of job (categorical: "admin.","unknown","unemployed","management","housemaid","entrepreneur","student",
"blue-collar","self-employed","retired","technician","services")
3 - marital : marital status (categorical: "married","divorced","single"; note: "divorced" means divorced or widowed)
4 - education (categorical: "unknown","secondary","primary","tertiary")
5 - default: has credit in default? (binary: "yes","no")
6 - balance: average yearly balance, in euros (numeric)
7 - housing: has housing loan? (binary: "yes","no")
8 - loan: has personal loan? (binary: "yes","no")

related with the last contact of the current campaign:

9 - contact: contact communication type (categorical: "unknown","telephone","cellular")
10 - day: last contact day of the month (numeric)
11 - month: last contact month of year (categorical: "jan", "feb", "mar", ..., "nov", "dec")
12 - duration: last contact duration, in seconds (numeric)

other attributes:

13 - campaign: number of contacts performed during this campaign and for this client (numeric, includes last contact)
14 - pdays: number of days that passed by after the client was last contacted from a previous campaign (numeric, -1 means client was not previously contacted)
15 - previous: number of contacts performed before this campaign and for this client (numeric)
16 - poutcome: outcome of the previous marketing campaign (categorical: "unknown","other","failure","success")

Output variable (desired target):
17 - y - has the client subscribed a term deposit? (binary: "yes","no")

Missing Attribute Values: None

Citation

This dataset is publicly available for research. It has been picked up from the UCI Machine Learning with random sampling and a few additional columns.

Please add this citation if you use this dataset for any further analysis.

S. Moro, P. Cortez and P. Rita. A Data-Driven Approach to Predict the Success of Bank Telemarketing. Decision Support Systems, Elsevier, 62:22-31, June 2014

Past Usage

The full dataset was described and analyzed in:

  • S. Moro, R. Laureano and P. Cortez. Using Data Mining for Bank Direct Marketing: An Application of the CRISP-DM Methodology.
  • In P. Novais et al. (Eds.), Proceedings of the European Simulation and Modelling Conference - ESM'2011, pp. 117-121, Guimarães, Portugal, October, 2011. EUROSIS.

Acknowledgement

Created by: Paulo Cortez (Univ. Minho) and Sérgio Moro (ISCTE-IUL) @ 2012. Thanks to Berkin Kaplanoğlu for helping with the proper column descriptions.

Tables

Test

@kaggle.prakharrathi25_banking_dataset_marketing_targets.test
  • 63.02 KB
  • 4521 rows
  • 17 columns
Loading...

CREATE TABLE test (
  "age" BIGINT,
  "job" VARCHAR,
  "marital" VARCHAR,
  "education" VARCHAR,
  "default" VARCHAR,
  "balance" BIGINT,
  "housing" VARCHAR,
  "loan" VARCHAR,
  "contact" VARCHAR,
  "day" BIGINT,
  "month" VARCHAR,
  "duration" BIGINT,
  "campaign" BIGINT,
  "pdays" BIGINT,
  "previous" BIGINT,
  "poutcome" VARCHAR,
  "y" VARCHAR
);

Train

@kaggle.prakharrathi25_banking_dataset_marketing_targets.train
  • 350.12 KB
  • 45211 rows
  • 17 columns
Loading...

CREATE TABLE train (
  "age" BIGINT,
  "job" VARCHAR,
  "marital" VARCHAR,
  "education" VARCHAR,
  "default" VARCHAR,
  "balance" BIGINT,
  "housing" VARCHAR,
  "loan" VARCHAR,
  "contact" VARCHAR,
  "day" BIGINT,
  "month" VARCHAR,
  "duration" BIGINT,
  "campaign" BIGINT,
  "pdays" BIGINT,
  "previous" BIGINT,
  "poutcome" VARCHAR,
  "y" VARCHAR
);

Share link

Anyone who has the link will be able to view this.