Key Corona COVID-19 data sources with API
These are the main sources of coronavirus data ready for direct download. Unfortunately, these data are not available as open data with open data license. If you do not like bureaucracy and want to stay informed, feel free to download it.
#1 John Hopkins University
Name: Daily Cases, Deaths, Recovered
Source: John Hopkins
Retrieved through: https://github.com/CSSEGISandData/COVID-19/blob/master/csse_covid_19_data/csse_covid_19_daily_reports/03-16-2020.csv
More Info about dataset:
https://coronadatasource.org/open-data/corona-data/corona-time-series-dataset-by-jhsu-csse/
Quality of Data:
- Coverage: Global
- Update mechanism: Once daily
- Data type: csv
- Way data is provided: Aggregation across country sources
- Notes: https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data
John Hopkins Data Format
Province/State // Country/Region // Last Update // Confirmed // Deaths // Recovered // Latitude // Longitude
#2 ECDP
Name: Cases, Deaths (per day)
Source: European Center for Disease Protection
Retrieved through: https://www.ecdc.europa.eu/en/publications-data/download-todays-data-geographic-distribution-covid-19-cases-worldwide
Quality of Data:
- Coverage: Global
- Update mechanism: Once daily
- Way data is provided: Original source
- Notes: https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data
European Center for Disease Protection Data Format
DateRep // Day // Month // Year // Cases // Deaths //Countries and territories //GeoId
#3 DXY China
Name: diagnosed, cumulative, death, cured by region
Source: NCOV DXY (Chinese online community for physicians)
Retrieved through: https://ncov.dxy.cn/ncovh5/view/pneumonia
Available Data (insert data table)
Quality of Data:
- Coverage: mainly China
- Update mechanism: tbd
- Way data is provided: Original source
#4 Tencent
Name: cum diagnosed, confirmed diagnoses, cum cured, cum death, existing severe, overseas input
Source: Tencent
Retrieved through: https://news.qq.com/zt2020/page/feiyan.htm#/
Quality of Data:
- Coverage: mainly China and selected countries
- Update mechanism: tbd
- Way data is provided: tbd
#5 WHO
Name: cum diagnosed, confirmed diagnoses, cum cured, cum death, existing severe, overseas input
Source: WHO
Available Data (insert data table)
Quality of Data:
- Coverage: Global
- Update mechanism: tbd
- Way data is provided: json
- Notes: Visual showcase https://experience.arcgis.com/experience/685d0ace521648f8a5beeeee1b9125cd
WHO Data Format
“name”: “DateOfDataEntry”, “type”: “esriFieldTypeDate”, “alias”: “DateOfDataEntry”, “sqlType”: “sqlTypeOther”, “length”: 8, “domain”: null, “defaultValue”: null } ], “features”: [ { “attributes”: { “OBJECTID”: 216, “NewCase”: 85, “DateOfDataEntry”: 1583971200000
Starting list of additional resources for broader related datasets
#6 Hospital Beds (California)
Certified Healthcare Facility Bed Types and Counts
Name: Facility name, facility classification, bed capacity type, bed capacity, county
Source: California Health and Human Services https://www.chhs.ca.gov/
Retrieved through:
Available Data (insert data table)
Quality of Data:
- Coverage: California, USA
- Update mechanism: ?
- Way data is provided: csv / API
Notes:
<d:_id m:type=”Edm.String”>2</d:_id> <d:FACID m:type=”Edm.String”>010000003</d:FACID> <d:FACNAME m:type=”Edm.String”>CREEKSIDE REHABILITATION & BEHAVIORAL HEALTH</d:FACNAME> <d:FAC_FDR m:type=”Edm.String”>SKILLED NURSING FACILITY</d:FAC_FDR> <d:BED_CAPACITY_TYPE m:type=”Edm.String”>SPECIAL TREATMENT PROGRAM</d:BED_CAPACITY_TYPE> <d:BED_CAPACITY m:type=”Edm.Double”>58</d:BED_CAPACITY> <d:COUNTY_NAME m:type=”Edm.String”>SONOMA</d:COUNTY_NAME>
The Data API can be accessed via the following actions of the CKAN action API.
Create
https://data.chhs.ca.gov/api/3/action/datastore_create
Update / Insert
https://data.chhs.ca.gov/api/3/action/datastore_upsert
Query
https://data.chhs.ca.gov/api/3/action/datastore_search
Query (via SQL)
https://data.chhs.ca.gov/api/3/action/datastore_search_sql
OData
https://data.chhs.ca.gov/datastore/odata3.0/0997fa8e-ef7c-43f2-8b9a-94672935fa60
#7 Dataset of Corona Research
Name: CORD-19
Source: Semantic Scholar USA
Retrieved through: https://pages.semanticscholar.org/coronavirus-research
Available Data (insert data table): CORD-19 is a resource of over 29,000 scholarly articles, including over 13,000 with full text, about COVID-19, SARS-CoV-2, and related coronaviruses.
Quality of Data:
- Coverage: Global
- Update mechanism: ?
- Way data is provided: file download / API
Scrapers / automation tools / untrusted sites
Scrapers/Automation:
- Pipedream – Pravin (untrusted source) https://pipedream.com/@pravin/http-api-for-latest-wuhan-coronavirus-data-2019-ncov-p_G6CLVM/readme
Untrusted sites:
- https://github.com/NovelCOVID/API
- https://github.com/ExpDev07/coronavirus-tracker-api
- https://github.com/javieraviles/covidAPI
Ongoing projects
Related projects:
- Insights on Covid-19 data and hypothesis testing https://github.com/alexander-stage/athena
- Kaggle Challenge https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge
- Reddit — Coronavirus specific GPU projects are now available at folding@home. CPU projects coming ASAP. Join us in fighting against Covid-19! https://www.reddit.com/r/pcmasterrace/comments/fhb5e4/coronavirus_specific_gpu_projects_are_now/
- Help Covid Projects – https://helpwithcovid.com/projects
Original source of article:
https://www.linkedin.com/pulse/data-around-corona-key-datasets-apis-starting-point-new-umbach/
Project of original author of article: https://helpwithcovid.com/projects/36
Do you have any iformation about interesting open data sources or portals with downloadable open data?
Help us to improve Corona Data Source by sharing your data tips.
(Find out more)
Last Updated on