Baselight

Exploration Of Dunhumby Product Transaction Data

Finding the best performing departments for developing marketing strategies

@kaggle.skoch2500_exploration_of_dunhumby_product_transaction_data

Loading...
Loading...

About this Dataset

Exploration Of Dunhumby Product Transaction Data

This data analysis uses Hive 2.0 (IDE for SQL) to explore product data from the Dunhumby dataset. It involves the product.csv and transaction_data.csv files. The analysis involves creating views to find the most productive departments within two years and determining where marketing strategies should be implemented for future sales growth.

Tables

Product

@kaggle.skoch2500_exploration_of_dunhumby_product_transaction_data.product
  • 1.21 MB
  • 92353 rows
  • 7 columns
Loading...

CREATE TABLE product (
  "product_id" BIGINT,
  "manufacturer" BIGINT,
  "department" VARCHAR,
  "brand" VARCHAR,
  "commodity_desc" VARCHAR,
  "sub_commodity_desc" VARCHAR,
  "curr_size_of_product" VARCHAR
);

Transaction Data

@kaggle.skoch2500_exploration_of_dunhumby_product_transaction_data.transaction_data
  • 20.77 MB
  • 2595732 rows
  • 12 columns
Loading...

CREATE TABLE transaction_data (
  "household_key" BIGINT,
  "basket_id" BIGINT,
  "day" BIGINT,
  "product_id" BIGINT,
  "quantity" BIGINT,
  "sales_value" DOUBLE,
  "store_id" BIGINT,
  "retail_disc" DOUBLE,
  "trans_time" BIGINT,
  "week_no" BIGINT,
  "coupon_disc" DOUBLE,
  "coupon_match_disc" DOUBLE
);

Share link

Anyone who has the link will be able to view this.