Baselight

E-commerce Business Transaction

Sales transaction of a UK-based e-commerce (online retail) for one year

@kaggle.gabrielramos87_an_online_shop_business

Loading...
Loading...

About this Dataset

E-commerce Business Transaction

Context

E-commerce has become a new channel to support businesses development. Through e-commerce, businesses can get access and establish a wider market presence by providing cheaper and more efficient distribution channels for their products or services. E-commerce has also changed the way people shop and consume products and services. Many people are turning to their computers or smart devices to order goods, which can easily be delivered to their homes.

Content

This is a sales transaction data set of UK-based e-commerce (online retail) for one year. This London-based shop has been selling gifts and homewares for adults and children through the website since 2007. Their customers come from all over the world and usually make direct purchases for themselves. There are also small businesses that buy in bulk and sell to other customers through retail outlet channels.

The data set contains 500K rows and 8 columns. The following is the description of each column.

  1. TransactionNo (categorical): a six-digit unique number that defines each transaction. The letter “C” in the code indicates a cancellation.
  2. Date (numeric): the date when each transaction was generated.
  3. ProductNo (categorical): a five or six-digit unique character used to identify a specific product.
  4. Product (categorical): product/item name.
  5. Price (numeric): the price of each product per unit in pound sterling (£).
  6. Quantity (numeric): the quantity of each product per transaction. Negative values related to cancelled transactions.
  7. CustomerNo (categorical): a five-digit unique number that defines each customer.
  8. Country (categorical): name of the country where the customer resides.

There is a small percentage of order cancellation in the data set. Most of these cancellations were due to out-of-stock conditions on some products. Under this situation, customers tend to cancel an order as they want all products delivered all at once.

Inspiration

Information is a main asset of businesses nowadays. The success of a business in a competitive environment depends on its ability to acquire, store, and utilize information. Data is one of the main sources of information. Therefore, data analysis is an important activity for acquiring new and useful information. Analyze this dataset and try to answer the following questions.

  1. How was the sales trend over the months?
  2. What are the most frequently purchased products?
  3. How many products does the customer purchase in each transaction?
  4. What are the most profitable segment customers?
  5. Based on your findings, what strategy could you recommend to the business to gain more profit?

Photo by CardMapr on Unsplash

Tables

Sales Transaction V 4a

@kaggle.gabrielramos87_an_online_shop_business.sales_transaction_v_4a
  • 3.07 MB
  • 536350 rows
  • 8 columns
Loading...

CREATE TABLE sales_transaction_v_4a (
  "transactionno" VARCHAR,
  "date" TIMESTAMP,
  "productno" VARCHAR,
  "productname" VARCHAR,
  "price" DOUBLE,
  "quantity" BIGINT,
  "customerno" DOUBLE,
  "country" VARCHAR
);

Share link

Anyone who has the link will be able to view this.