Billionaires Dataset Cleaned
ready for Exploratory Data Analysis and Modeling
@kaggle.javiersab_billionaires_dataset_cleaned
ready for Exploratory Data Analysis and Modeling
@kaggle.javiersab_billionaires_dataset_cleaned
Cleaned dataset from the Billionaires Statistic Dataset (2023) that can be found here.
The code I used to clean and re-structure the data is also here.
First things first: a big shout-out to Nidula Elgiriyewithana for providing the original data.
As with it, this dataset contains various information about the world's wealthiest persons in different columns that can be grouped into three different types:
If you want a challenge, you can create a dashboard using tools such as Plotly to dynamically visualize the data using one or different attributes (such as industry, age or country). I did it, leave the link below in case you want to investigate:
Dashboard notebook here
CREATE TABLE df_ready (
  "position" BIGINT,
  "wealth" BIGINT,
  "industry" VARCHAR,
  "full_name" VARCHAR,
  "age" BIGINT,
  "country_of_residence" VARCHAR,
  "city_of_residence" VARCHAR,
  "source" VARCHAR,
  "citizenship" VARCHAR,
  "gender" VARCHAR,
  "birth_date" TIMESTAMP,
  "last_name" VARCHAR,
  "first_name" VARCHAR,
  "residence_state" VARCHAR,
  "residence_region" VARCHAR,
  "birth_year" BIGINT,
  "birth_month" BIGINT,
  "birth_day" BIGINT,
  "cpi_country" VARCHAR,
  "cpi_change_country" VARCHAR,
  "gdp_country" BIGINT,
  "g_tertiary_ed_enroll" DOUBLE,
  "g_primary_ed_enroll" DOUBLE,
  "life_expectancy" DOUBLE,
  "tax_revenue" DOUBLE,
  "tax_rate" DOUBLE,
  "country_pop" BIGINT,
  "country_lat" DOUBLE,
  "country_long" DOUBLE,
  "continent" VARCHAR
);Anyone who has the link will be able to view this.