Baselight

100,000 UK Used Car Data Set

100,000 scraped used car listings, cleaned and split into car make.

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes

Loading...
Loading...

About this Dataset

100,000 UK Used Car Data Set

If you download/use the data set I'd appreciate an up vote, cheers.

Updated

Scraped data of used cars listings. 100,000 listings, which have been separated into files corresponding to each car manufacturer. I collected the data to make a tool to predict how much my friend should sell his old car for compared to other stuff on the market, and then just extended the data set. Then made a more general car value regression model.

previous version

Picked two fairly common cars on the British market for analysis (Ford Focus and Mercedes C Class). The hope is to find info such as: when is the ideal time to sell certain cars (i.e. at what age and mileage are there significant drops in resale value). Also can make comparisons between the two, and make a classifier for a ford or Mercedes car. Can easily add more makes and models, so comment for any request e.g. if you want a big data set of all Mercedes makes and models.

Content

The cleaned data set contains information of price, transmission, mileage, fuel type, road tax, miles per gallon (mpg), and engine size. I've removed duplicate listings and cleaned the columns, but have included a notebook showing the process and the original data for anyone who wants to check/improve my work.

Inspiration

It'd be cool to have some insights and visualisations of the data. Also, am open to ideas on how to expand the data set.

Tables

Audi

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes.audi
  • 127.14 KB
  • 10668 rows
  • 9 columns
Loading...

CREATE TABLE audi (
  "model" VARCHAR,
  "year" BIGINT,
  "price" BIGINT,
  "transmission" VARCHAR,
  "mileage" BIGINT,
  "fueltype" VARCHAR,
  "tax" BIGINT,
  "mpg" DOUBLE,
  "enginesize" DOUBLE
);

Bmw

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes.bmw
  • 129.98 KB
  • 10781 rows
  • 9 columns
Loading...

CREATE TABLE bmw (
  "model" VARCHAR,
  "year" BIGINT,
  "price" BIGINT,
  "transmission" VARCHAR,
  "mileage" BIGINT,
  "fueltype" VARCHAR,
  "tax" BIGINT,
  "mpg" DOUBLE,
  "enginesize" DOUBLE
);

Cclass

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes.cclass
  • 43.81 KB
  • 3899 rows
  • 7 columns
Loading...

CREATE TABLE cclass (
  "model" VARCHAR,
  "year" BIGINT,
  "price" BIGINT,
  "transmission" VARCHAR,
  "mileage" BIGINT,
  "fueltype" VARCHAR,
  "enginesize" DOUBLE
);

Focus

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes.focus
  • 53.97 KB
  • 5454 rows
  • 7 columns
Loading...

CREATE TABLE focus (
  "model" VARCHAR,
  "year" BIGINT,
  "price" BIGINT,
  "transmission" VARCHAR,
  "mileage" BIGINT,
  "fueltype" VARCHAR,
  "enginesize" DOUBLE
);

Ford

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes.ford
  • 201.23 KB
  • 17965 rows
  • 9 columns
Loading...

CREATE TABLE ford (
  "model" VARCHAR,
  "year" BIGINT,
  "price" BIGINT,
  "transmission" VARCHAR,
  "mileage" BIGINT,
  "fueltype" VARCHAR,
  "tax" BIGINT,
  "mpg" DOUBLE,
  "enginesize" DOUBLE
);

Hyundi

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes.hyundi
  • 61.4 KB
  • 4860 rows
  • 9 columns
Loading...

CREATE TABLE hyundi (
  "model" VARCHAR,
  "year" BIGINT,
  "price" BIGINT,
  "transmission" VARCHAR,
  "mileage" BIGINT,
  "fueltype" VARCHAR,
  "tax" BIGINT,
  "mpg" DOUBLE,
  "enginesize" DOUBLE
);

Merc

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes.merc
  • 158.55 KB
  • 13119 rows
  • 9 columns
Loading...

CREATE TABLE merc (
  "model" VARCHAR,
  "year" BIGINT,
  "price" BIGINT,
  "transmission" VARCHAR,
  "mileage" BIGINT,
  "fueltype" VARCHAR,
  "tax" BIGINT,
  "mpg" DOUBLE,
  "enginesize" DOUBLE
);

Skoda

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes.skoda
  • 75.48 KB
  • 6267 rows
  • 9 columns
Loading...

CREATE TABLE skoda (
  "model" VARCHAR,
  "year" BIGINT,
  "price" BIGINT,
  "transmission" VARCHAR,
  "mileage" BIGINT,
  "fueltype" VARCHAR,
  "tax" BIGINT,
  "mpg" DOUBLE,
  "enginesize" DOUBLE
);

Toyota

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes.toyota
  • 79.01 KB
  • 6738 rows
  • 9 columns
Loading...

CREATE TABLE toyota (
  "model" VARCHAR,
  "year" BIGINT,
  "price" BIGINT,
  "transmission" VARCHAR,
  "mileage" BIGINT,
  "fueltype" VARCHAR,
  "tax" BIGINT,
  "mpg" DOUBLE,
  "enginesize" DOUBLE
);

Unclean Cclass

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes.unclean_cclass
  • 88.72 KB
  • 4006 rows
  • 11 columns
Loading...

CREATE TABLE unclean_cclass (
  "model" VARCHAR,
  "year" DOUBLE,
  "price" VARCHAR,
  "transmission" VARCHAR,
  "mileage" VARCHAR,
  "fuel_type" VARCHAR,
  "engine_size" VARCHAR,
  "mileage2" VARCHAR,
  "fuel_type2" VARCHAR,
  "engine_size2" VARCHAR,
  "reference" VARCHAR
);

Unclean Focus

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes.unclean_focus
  • 117.19 KB
  • 5604 rows
  • 11 columns
Loading...

CREATE TABLE unclean_focus (
  "model" VARCHAR,
  "year" DOUBLE,
  "price" VARCHAR,
  "transmission" VARCHAR,
  "mileage" VARCHAR,
  "fuel_type" VARCHAR,
  "engine_size" VARCHAR,
  "mileage2" DOUBLE,
  "fuel_type2" VARCHAR,
  "engine_size2" VARCHAR,
  "reference" VARCHAR
);

Vauxhall

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes.vauxhall
  • 143.17 KB
  • 13632 rows
  • 9 columns
Loading...

CREATE TABLE vauxhall (
  "model" VARCHAR,
  "year" BIGINT,
  "price" BIGINT,
  "transmission" VARCHAR,
  "mileage" BIGINT,
  "fueltype" VARCHAR,
  "tax" BIGINT,
  "mpg" DOUBLE,
  "enginesize" DOUBLE
);

Vw

@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes.vw
  • 159.89 KB
  • 15157 rows
  • 9 columns
Loading...

CREATE TABLE vw (
  "model" VARCHAR,
  "year" BIGINT,
  "price" BIGINT,
  "transmission" VARCHAR,
  "mileage" BIGINT,
  "fueltype" VARCHAR,
  "tax" BIGINT,
  "mpg" DOUBLE,
  "enginesize" DOUBLE
);

Share link

Anyone who has the link will be able to view this.