Baselight

BMW Pricing Challenge

Predict the value of a used car

@kaggle.danielkyrka_bmw_pricing_challenge

About this Dataset

BMW Pricing Challenge

Context

Estimating the value of a used car is one of the main everyday challenges in automotive business. We believe that the sales price of a car is not only based on the value of the product itself, but is also heavily influenced by things like market trends, current availability and politics.
With this challenge we hope to raise some interest in this exciting topic and also gain some insight in what the main factors are that drive the value of a used car.

Content

The data provided consists of almost 5000 real BMW cars that were sold via a b2b auction in 2018. The price shown in the table is the highest bid that was reached during the auction.

We have already done some data cleanup and filtered out cars with engine damage etc. However there may still be minor damages like scratches, but we do not have more information about that.

We have also extracted 8 criteria based on the equipment of car that we think might have a good impact on the value of a used car. These criteria have been labeled feature_1 to feature_8 and are shown in the data below.

Inspiration

We would like to find a good statistical model to describe the value of a used car depending on the basic description and the 8 provided features. The following questions are of special interest to us:

  1. How much impact does each of features have on the estimate value of the car?

  2. How does the estimated value of a car change over time? Can you detect any patterns? (e.g. the price of a convertible should be higher in summer than in winter)

  3. How big is the influence of the factors not represented in the data on the price? Or, in other words, what is the estimated variance included in your statistical model?

Share link

Anyone who has the link will be able to view this.