Predicting Profitable Customer Segments
Can you predict which customer groups are worth investing in?
@kaggle.tsiaras_predicting_profitable_customer_segments
Can you predict which customer groups are worth investing in?
@kaggle.tsiaras_predicting_profitable_customer_segments
Marketing is a key component of every modern business. Companies continuously re-invest large cuts of their profits for marketing purposes, trying to target groups of customers who have the potential to bring back the highest Return On Investment for the company. The cost of marketing can be very high though, meaning that the decision about which customer group to target is of great financial importance.
This dataset was made available by an online retail company that has collected historical data about such groups of customers, tracked the profitability of each individual group after the respective marketing campaign and retrospectively assessed whether investing on marketing spend for that group was a good choice.
In order to enable machine learning experimentation, this dataset has been structured as follows:
Each row is a comparison between two groups of potential customers:
The last column, named "target", is categorical, with 3 categories:
0 - none of the two groups were profitable
1 - group1 turned out to be more profitable
2 - group2 turned out to be more profitable
Can you build a machine learning classifier that accurately predicts which of the 2 groups (if any) will turn out to be more profitable?
CREATE TABLE customertargeting (
"g1_1" DOUBLE,
"g1_2" BIGINT,
"g1_3" BIGINT,
"g1_4" BIGINT,
"g1_5" BIGINT,
"g1_6" BIGINT,
"g1_7" BIGINT,
"g1_8" BIGINT,
"g1_9" BIGINT,
"g1_10" BIGINT,
"g1_11" BIGINT,
"g1_12" BIGINT,
"g1_13" DOUBLE,
"g1_14" DOUBLE,
"g1_15" DOUBLE,
"g1_16" DOUBLE,
"g1_17" DOUBLE,
"g1_18" DOUBLE,
"g1_19" DOUBLE,
"g1_20" DOUBLE,
"g1_21" DOUBLE,
"g2_1" DOUBLE,
"g2_2" BIGINT,
"g2_3" BIGINT,
"g2_4" BIGINT,
"g2_5" BIGINT,
"g2_6" BIGINT,
"g2_7" BIGINT,
"g2_8" BIGINT,
"g2_9" BIGINT,
"g2_10" BIGINT,
"g2_11" BIGINT,
"g2_12" BIGINT,
"g2_13" DOUBLE,
"g2_14" DOUBLE,
"g2_15" DOUBLE,
"g2_16" DOUBLE,
"g2_17" DOUBLE,
"g2_18" DOUBLE,
"g2_19" DOUBLE,
"g2_20" DOUBLE,
"g2_21" DOUBLE,
"c_1" DOUBLE,
"c_2" BIGINT,
"c_3" BIGINT,
"c_4" BIGINT,
"c_5" BIGINT,
"c_6" BIGINT,
"c_7" BIGINT,
"c_8" BIGINT,
"c_9" DOUBLE,
"c_10" BIGINT,
"c_11" BIGINT,
"c_12" BIGINT,
"c_13" BIGINT,
"c_14" BIGINT,
"c_15" BIGINT,
"c_16" BIGINT,
"c_17" BIGINT,
"c_18" BIGINT,
"c_19" BIGINT,
"c_20" DOUBLE,
"c_21" DOUBLE,
"c_22" DOUBLE,
"c_23" DOUBLE,
"c_24" BIGINT,
"c_25" DOUBLE,
"c_26" DOUBLE,
"c_27" DOUBLE,
"c_28" DOUBLE,
"target" BIGINT
);Anyone who has the link will be able to view this.