Baselight

Ethereum Effect Impact On PC Parts Prices

Data Warehouse for analysis of Ethereum Effect from 2013 to april 2018

@kaggle.raczeq_ethereum_effect_pc_parts

Loading...
Loading...

About this Dataset

Ethereum Effect Impact On PC Parts Prices

Project is not maintained anymore and probably will never be again. Whole python code used for data scrapping is pasted into a notebook attached to this project. I'm sorry for the inconvenience.

Context

Data warehouse has been created as a University project throughout 3 months (march - may 2018).

I haven't find any useful databases containing historical prices of many computer parts (CPUs, GPUs and RAMs), so I had to web scrap it from web prices comparison engines:

For webscrapping I've used Python with BeautifulSoup library as well as PCPartPicker-API.

Stuff that I'd like to add but I have no time to do it:

  • Compare RAM prices with raising demand for memory chips in smartphones industry and Data Centers for cloud computing
  • Scrap more price comparison engines
  • Add new dimension for DIM_PROD_GPU -> Series. I've made it too shallow and my analysis wasn't so great, additional step between Manufacturer and Product Name would make it easier for analytics.

Content

Database contains data about:

  • 15 most popular cryptocurrencies and their rates
  • 1664 CPU Products
  • 2054 GPU Products
  • 3706 RAM Products
  • 6 000 000+ records containing products historical prices

Acknowledgements

Few redditers who helped me to find some libraries and datasets to get inspired with.

My request post on r/datasets

Inspiration

As a gamer I've seen a huge prices spikes on a GPU market and as I haven't find too many analysis about this phenomenon and because it's very fresh I thought that might be a good topic to make a Uni project for a Data Warehouse course.


Photo Source

I release this data online so it won't waste. Maybe you will find some more interesting results that I did.

Installation

Uploaded csv files have been dumped from my data warehouse. Below you can see whole ERD diagram

Only difference: DIM_REGION contains one additional column that I didn't include in my data warehouse

Tables

Dim Cpu Prod

@kaggle.raczeq_ethereum_effect_pc_parts.dim_cpu_prod
  • 27.31 KB
  • 1664 rows
  • 6 columns
Loading...

CREATE TABLE dim_cpu_prod (
  "id" BIGINT,
  "manufacturer" VARCHAR,
  "series" VARCHAR,
  "cpu_name" VARCHAR,
  "cores" BIGINT,
  "socket" VARCHAR
);

Dim Crypto Data

@kaggle.raczeq_ethereum_effect_pc_parts.dim_crypto_data
  • 3.56 KB
  • 15 rows
  • 4 columns
Loading...

CREATE TABLE dim_crypto_data (
  "id" BIGINT,
  "code" VARCHAR,
  "currency_name" VARCHAR,
  "is_mineable" BIGINT
);

Dim Gpu Prod

@kaggle.raczeq_ethereum_effect_pc_parts.dim_gpu_prod
  • 21.8 KB
  • 2054 rows
  • 6 columns
Loading...

CREATE TABLE dim_gpu_prod (
  "id" BIGINT,
  "processor_manufacturer" VARCHAR,
  "processor" VARCHAR,
  "gpu_manufacturer" VARCHAR,
  "memory_capacity" DOUBLE,
  "memory_type" VARCHAR
);

Dim Merchant

@kaggle.raczeq_ethereum_effect_pc_parts.dim_merchant
  • 3.47 KB
  • 81 rows
  • 2 columns
Loading...

CREATE TABLE dim_merchant (
  "id" BIGINT,
  "merchant" VARCHAR
);

Dim Region

@kaggle.raczeq_ethereum_effect_pc_parts.dim_region
  • 2.67 KB
  • 11 rows
  • 3 columns
Loading...

CREATE TABLE dim_region (
  "id" BIGINT,
  "code" VARCHAR,
  "currency" VARCHAR
);

Dim Time

@kaggle.raczeq_ethereum_effect_pc_parts.dim_time
  • 14.79 KB
  • 1847 rows
  • 5 columns
Loading...

CREATE TABLE dim_time (
  "id" BIGINT,
  "year" BIGINT,
  "month" BIGINT,
  "day" BIGINT,
  "week" BIGINT
);

Fact Cpu Price

@kaggle.raczeq_ethereum_effect_pc_parts.fact_cpu_price
  • 19.67 MB
  • 1605191 rows
  • 6 columns
Loading...

CREATE TABLE fact_cpu_price (
  "prodid" BIGINT,
  "timeid" BIGINT,
  "regionid" BIGINT,
  "merchantid" BIGINT,
  "price_usd" DOUBLE,
  "price_original" DOUBLE
);

Fact Crypto Rate

@kaggle.raczeq_ethereum_effect_pc_parts.fact_crypto_rate
  • 403.29 KB
  • 15844 rows
  • 6 columns
Loading...

CREATE TABLE fact_crypto_rate (
  "codeid" BIGINT,
  "timeid" BIGINT,
  "open" DOUBLE,
  "close" DOUBLE,
  "high" DOUBLE,
  "low" DOUBLE
);

Fact Gpu Price

@kaggle.raczeq_ethereum_effect_pc_parts.fact_gpu_price
  • 12.33 MB
  • 997156 rows
  • 6 columns
Loading...

CREATE TABLE fact_gpu_price (
  "prodid" BIGINT,
  "timeid" BIGINT,
  "regionid" BIGINT,
  "merchantid" BIGINT,
  "price_usd" DOUBLE,
  "price_original" DOUBLE
);

Fact Ram Price

@kaggle.raczeq_ethereum_effect_pc_parts.fact_ram_price
  • 41.94 MB
  • 3412331 rows
  • 6 columns
Loading...

CREATE TABLE fact_ram_price (
  "prodid" BIGINT,
  "timeid" BIGINT,
  "regionid" BIGINT,
  "merchantid" BIGINT,
  "price_usd" DOUBLE,
  "price_original" DOUBLE
);

Share link

Anyone who has the link will be able to view this.