Exploration Of Dunhumby Product Transaction Data
Finding the best performing departments for developing marketing strategies
@kaggle.skoch2500_exploration_of_dunhumby_product_transaction_data
Finding the best performing departments for developing marketing strategies
@kaggle.skoch2500_exploration_of_dunhumby_product_transaction_data
This data analysis uses Hive 2.0 (IDE for SQL) to explore product data from the Dunhumby dataset. It involves the product.csv and transaction_data.csv files. The analysis involves creating views to find the most productive departments within two years and determining where marketing strategies should be implemented for future sales growth.
CREATE TABLE product (
"product_id" BIGINT,
"manufacturer" BIGINT,
"department" VARCHAR,
"brand" VARCHAR,
"commodity_desc" VARCHAR,
"sub_commodity_desc" VARCHAR,
"curr_size_of_product" VARCHAR
);
CREATE TABLE transaction_data (
"household_key" BIGINT,
"basket_id" BIGINT,
"day" BIGINT,
"product_id" BIGINT,
"quantity" BIGINT,
"sales_value" DOUBLE,
"store_id" BIGINT,
"retail_disc" DOUBLE,
"trans_time" BIGINT,
"week_no" BIGINT,
"coupon_disc" DOUBLE,
"coupon_match_disc" DOUBLE
);
Anyone who has the link will be able to view this.