Baselight

COVID-19 Visualisation And Epidemic Analysis Data

Includes demographics, daily new, remaining and cumulative cases

@kaggle.dylansp_covid19_country_level_data_for_epidemic_model

Loading...
Loading...

About this Dataset

COVID-19 Visualisation And Epidemic Analysis Data

COVID-19 Dataset for Epidemic Model Development

I combined several data sources to gain an integrated dataset involving country-level COVID-19 confirmed, recovered and fatalities cases which can be used to build some epidemic models such as SIR, SIR with mortality. Adding information regarding population which can be used for calculating incidence rate and prevalence rate.
One of my applications based on this dataset is published at https://dylansp.shinyapps.io/COVID19_Visualization_Analysis_Tool/.

Content

My approach is to retrieve cumulative confirmed cases, fatalities and recovered cases since 2020-01-22 onwards from the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE) COVID-19 dataset, merged with country code as well as population of each country. For the purpose of building epidemic models, I calculated information regarding daily new confirmed cases, recovered cases, and fatalities, together with remaining confirmed cases which equal to cumulative confirmed cases - cumulative recovered cases - cumulative fatalities. I haven't yet to find creditable data sources regarding probable cases of various countries yet. I'll add them once I found them.

  • Date: The date of the record.
  • Country_Region: The name of the country/region.
    -alpha-3_code: country code for that can be used for map visualization.
  • Population: The population of the given country/region.
  • Total_Confirmed_Cases: Cumulative confirmed cases.
  • Total_Fatalities: Cumulative fatalities.
  • Total_Recovered_Cases: Cumulative recovered cases.
  • New_Confirmed_Cases: Daily new confirmed cases.
  • New_Fatalities: Daily new fatalities.
  • New_Recovered_Cases: Daily new recovered cases.
  • Remaining_Confirmed_Cases: Remaining infected cases which equal to (cumulative confirmed cases - cumulative recovered cases - cumulative fatalities).

Acknowledgements

  1. The data source of confirmed cases, recovered cases and deaths is JHU CSSE https://github.com/CSSEGISandData/COVID-19;
  2. The data source of the country-level population mainly comes from https://storage.guidotti.dev/covid19/data/ and Worldometer (https://www.worldometers.info/population/).

Inspiration

  1. Building up the country-level COVID-19 case track dashboard.
  2. Insights regarding the incidence rate, prevalence rate, mortality and recovery rate of various countries.
  3. Building up epidemic models for forecasting.

Tables

Country Level Info Kaggle

@kaggle.dylansp_covid19_country_level_data_for_epidemic_model.country_level_info_kaggle
  • 1.1 MB
  • 70464 rows
  • 11 columns
Loading...

CREATE TABLE country_level_info_kaggle (
  "date" TIMESTAMP,
  "country_region" VARCHAR,
  "alpha_3_code" VARCHAR,
  "population" BIGINT,
  "total_confirmed_cases" BIGINT,
  "total_fatalities" BIGINT,
  "total_recovered_cases" BIGINT,
  "new_confirmed_cases" BIGINT,
  "new_fatalities" BIGINT,
  "new_recovered_cases" BIGINT,
  "remaining_confirmed_cases" BIGINT
);

Share link

Anyone who has the link will be able to view this.