100,000 UK Used Car Data Set
100,000 scraped used car listings, cleaned and split into car make.
@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes
100,000 scraped used car listings, cleaned and split into car make.
@kaggle.adityadesai13_used_car_dataset_ford_and_mercedes
If you download/use the data set I'd appreciate an up vote, cheers.
Scraped data of used cars listings. 100,000 listings, which have been separated into files corresponding to each car manufacturer. I collected the data to make a tool to predict how much my friend should sell his old car for compared to other stuff on the market, and then just extended the data set. Then made a more general car value regression model.
Picked two fairly common cars on the British market for analysis (Ford Focus and Mercedes C Class). The hope is to find info such as: when is the ideal time to sell certain cars (i.e. at what age and mileage are there significant drops in resale value). Also can make comparisons between the two, and make a classifier for a ford or Mercedes car. Can easily add more makes and models, so comment for any request e.g. if you want a big data set of all Mercedes makes and models.
The cleaned data set contains information of price, transmission, mileage, fuel type, road tax, miles per gallon (mpg), and engine size. I've removed duplicate listings and cleaned the columns, but have included a notebook showing the process and the original data for anyone who wants to check/improve my work.
It'd be cool to have some insights and visualisations of the data. Also, am open to ideas on how to expand the data set.
CREATE TABLE audi (
"model" VARCHAR,
"year" BIGINT,
"price" BIGINT,
"transmission" VARCHAR,
"mileage" BIGINT,
"fueltype" VARCHAR,
"tax" BIGINT,
"mpg" DOUBLE,
"enginesize" DOUBLE
);CREATE TABLE bmw (
"model" VARCHAR,
"year" BIGINT,
"price" BIGINT,
"transmission" VARCHAR,
"mileage" BIGINT,
"fueltype" VARCHAR,
"tax" BIGINT,
"mpg" DOUBLE,
"enginesize" DOUBLE
);CREATE TABLE cclass (
"model" VARCHAR,
"year" BIGINT,
"price" BIGINT,
"transmission" VARCHAR,
"mileage" BIGINT,
"fueltype" VARCHAR,
"enginesize" DOUBLE
);CREATE TABLE focus (
"model" VARCHAR,
"year" BIGINT,
"price" BIGINT,
"transmission" VARCHAR,
"mileage" BIGINT,
"fueltype" VARCHAR,
"enginesize" DOUBLE
);CREATE TABLE ford (
"model" VARCHAR,
"year" BIGINT,
"price" BIGINT,
"transmission" VARCHAR,
"mileage" BIGINT,
"fueltype" VARCHAR,
"tax" BIGINT,
"mpg" DOUBLE,
"enginesize" DOUBLE
);CREATE TABLE hyundi (
"model" VARCHAR,
"year" BIGINT,
"price" BIGINT,
"transmission" VARCHAR,
"mileage" BIGINT,
"fueltype" VARCHAR,
"tax" BIGINT -- Tax(£),
"mpg" DOUBLE,
"enginesize" DOUBLE
);CREATE TABLE merc (
"model" VARCHAR,
"year" BIGINT,
"price" BIGINT,
"transmission" VARCHAR,
"mileage" BIGINT,
"fueltype" VARCHAR,
"tax" BIGINT,
"mpg" DOUBLE,
"enginesize" DOUBLE
);CREATE TABLE skoda (
"model" VARCHAR,
"year" BIGINT,
"price" BIGINT,
"transmission" VARCHAR,
"mileage" BIGINT,
"fueltype" VARCHAR,
"tax" BIGINT,
"mpg" DOUBLE,
"enginesize" DOUBLE
);CREATE TABLE toyota (
"model" VARCHAR,
"year" BIGINT,
"price" BIGINT,
"transmission" VARCHAR,
"mileage" BIGINT,
"fueltype" VARCHAR,
"tax" BIGINT,
"mpg" DOUBLE,
"enginesize" DOUBLE
);CREATE TABLE unclean_cclass (
"model" VARCHAR,
"year" DOUBLE,
"price" VARCHAR,
"transmission" VARCHAR,
"mileage" VARCHAR,
"fuel_type" VARCHAR,
"engine_size" VARCHAR,
"mileage2" VARCHAR,
"fuel_type2" VARCHAR,
"engine_size2" VARCHAR,
"reference" VARCHAR
);CREATE TABLE unclean_focus (
"model" VARCHAR,
"year" DOUBLE,
"price" VARCHAR,
"transmission" VARCHAR,
"mileage" VARCHAR,
"fuel_type" VARCHAR,
"engine_size" VARCHAR,
"mileage2" DOUBLE,
"fuel_type2" VARCHAR,
"engine_size2" VARCHAR,
"reference" VARCHAR
);CREATE TABLE vauxhall (
"model" VARCHAR,
"year" BIGINT,
"price" BIGINT,
"transmission" VARCHAR,
"mileage" BIGINT,
"fueltype" VARCHAR,
"tax" BIGINT,
"mpg" DOUBLE,
"enginesize" DOUBLE
);CREATE TABLE vw (
"model" VARCHAR,
"year" BIGINT,
"price" BIGINT,
"transmission" VARCHAR,
"mileage" BIGINT,
"fueltype" VARCHAR,
"tax" BIGINT,
"mpg" DOUBLE,
"enginesize" DOUBLE
);Anyone who has the link will be able to view this.