COVID-19 Numerical Claims Open Research Dataset
Numerical claims related to COVID-19
@kaggle.dshah1612_covid19_numerical_claims_open_research_dataset
Numerical claims related to COVID-19
@kaggle.dshah1612_covid19_numerical_claims_open_research_dataset
The COVID-19 Numerical Claims Open Research Dataset (CONCORD) is a comprehensive, open-source dataset containing numerical claims extracted from academic papers published on COVID-19-related research. CONCORD contains approximately 203k numerical claims pertinent to COVID-19, extracted from more than 57,000 scientific research articles published between January 2020 to May 2022. These claims are extracted from full-text research articles annotated using a white box, weakly supervised model. We used the CORD-19 repository as the raw dataset for our research work.
Why numerical claims?
Thumbnail Image source: https://indianexpress.com/article/cities/bangalore/unsustainable-urbanisation-coronavirus-variants-8062078/
CREATE TABLE concord (
"claim_uid" VARCHAR,
"cord_uid" VARCHAR,
"title" VARCHAR,
"doi" VARCHAR,
"numerical_claims" VARCHAR,
"publish_time" TIMESTAMP,
"authors" VARCHAR,
"journal" VARCHAR,
"country" VARCHAR,
"institution" VARCHAR
);Anyone who has the link will be able to view this.