This is a daily updating version of COVID-19 Data Repository by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University (JHU). The data updates every day at 6am UTC, which updates just after the raw JHU data typically updates.
I'm making it available in both a raw form (files with the prefix RAW) and convenient form (files prefixed with CONVENIENT).
The data covers:
- confirmed cases and deaths on a country level
- confirmed cases and deaths by US county
- some metadata that's available in the raw JHU data
The RAW version is exactly as it's distributed in the original dataset.
The CONVENIENT version is aiming to be easier to analyze. The data is organized by column rather than by row. The metadata is stripped out into a separate file. And it converted to daily change rather than cumulative totals.
If you find any issues in the data, then you can share them in this discussion thread. I will attempt to address the most upvoted issues.
If you have any requests for changing or enriching this data, please add them on this discussion thread. Again, I will attempt to address the most upvoted requests.
I have a notebook that updates just after each data dump updates, giving a brief overview of the latest data. It's also a useful reference if you want to see how to read the CONVENIENT data into a pandas DataFrame.