Baselight

ISA Literature Screening Dataset V.1

U.S. Environmental Protection Agency

@usgov.epa_gov_isa_literature_screening_dataset_v_1

Loading...
Loading...

About this Dataset

ISA Literature Screening Dataset V.1

This dataset contains one instance of the information used in searching and screening peer-reviewed literature to identify references to be used in EPA's Integrated Science Assessments (ISAs). It is comprised of six subsets, three for each of two ISAs for Ozone (2013 and 2020). The subsets are reference metadata, which contains all metadata for references found through searches and references cited in the respective ISA, Citation context, which contains the text of paragraph of the ISA where each reference was cited, and the semantic map, which provides the outline of the ISA in a hierarchical table of chapters, sections and subsections. See the Supporting documents for a more detailed description.
Organization: U.S. Environmental Protection Agency
Last updated: 2022-07-01T16:09:21.692893
Tags: data-management, information-science, machine-learning, methods-development, science-assessment

Tables

Citation Context 2013

@usgov.epa_gov_isa_literature_screening_dataset_v_1.citation_context_2013
  • 919.15 kB
  • 5,949 rows
  • 17 columns
Loading...
CREATE TABLE citation_context_2013 (
  "instance_id" BIGINT,
  "reference_id" BIGINT,
  "level_all_num" VARCHAR,
  "level1_num" BIGINT,
  "level2_num" BIGINT,
  "level3_num" VARCHAR,
  "level4_num" VARCHAR,
  "level5_num" VARCHAR,
  "level6_num" VARCHAR,
  "level1_char" VARCHAR,
  "level2_char" VARCHAR,
  "level3_char" VARCHAR,
  "level4_char" VARCHAR,
  "level5_char" VARCHAR,
  "level6_char" VARCHAR,
  "anchor_text" VARCHAR,
  "context_paragraph" VARCHAR
);

Citation Context 2020

@usgov.epa_gov_isa_literature_screening_dataset_v_1.citation_context_2020
  • 426.89 kB
  • 3,887 rows
  • 17 columns
Loading...
CREATE TABLE citation_context_2020 (
  "instance_id" BIGINT,
  "reference_id" BIGINT,
  "level_all_num" VARCHAR,
  "level1_num" BIGINT,
  "level2_num" BIGINT,
  "level3_num" VARCHAR,
  "level4_num" VARCHAR,
  "level5_num" VARCHAR,
  "level6_num" VARCHAR,
  "level1_char" VARCHAR,
  "level2_char" VARCHAR,
  "level3_char" VARCHAR,
  "level4_char" VARCHAR,
  "level5_char" VARCHAR,
  "level6_char" VARCHAR,
  "anchor_text" VARCHAR,
  "context_paragraph" VARCHAR
);

Reference Metadata 2020

@usgov.epa_gov_isa_literature_screening_dataset_v_1.reference_metadata_2020
  • 151.22 MB
  • 171,572 rows
  • 10 columns
Loading...
CREATE TABLE reference_metadata_2020 (
  "reference_id" BIGINT,
  "pmid" DOUBLE,
  "wosid" VARCHAR,
  "doi" VARCHAR,
  "title" VARCHAR,
  "authors" VARCHAR,
  "year" DOUBLE,
  "abstract" VARCHAR,
  "cited" VARCHAR,
  "in_search" VARCHAR
);

Semantic Map 2013

@usgov.epa_gov_isa_literature_screening_dataset_v_1.semantic_map_2013
  • 24.52 kB
  • 385 rows
  • 17 columns
Loading...
CREATE TABLE semantic_map_2013 (
  "isa_year" BIGINT,
  "level_all_num" VARCHAR,
  "level1_num" BIGINT,
  "level2_num" VARCHAR,
  "level3_num" VARCHAR,
  "level4_num" VARCHAR,
  "level5_num" VARCHAR,
  "level6_num" VARCHAR,
  "level1_char" VARCHAR,
  "level2_char" VARCHAR,
  "level3_char" VARCHAR,
  "level4_char" VARCHAR,
  "level5_char" VARCHAR,
  "level6_char" VARCHAR,
  "chapter_category" VARCHAR,
  "topic" VARCHAR,
  "discipline" VARCHAR
);

Semantic Map 2020

@usgov.epa_gov_isa_literature_screening_dataset_v_1.semantic_map_2020
  • 27.98 kB
  • 605 rows
  • 17 columns
Loading...
CREATE TABLE semantic_map_2020 (
  "isa_year" BIGINT,
  "level_all_num" VARCHAR,
  "level1_num" BIGINT,
  "level2_num" VARCHAR,
  "level3_num" VARCHAR,
  "level4_num" VARCHAR,
  "level5_num" VARCHAR,
  "level6_num" VARCHAR,
  "level1_char" VARCHAR,
  "level2_char" VARCHAR,
  "level3_char" VARCHAR,
  "level4_char" VARCHAR,
  "level5_char" VARCHAR,
  "level6_char" VARCHAR,
  "chapter_category" VARCHAR,
  "topic" VARCHAR,
  "discipline" VARCHAR
);

Share link

Anyone who has the link will be able to view this.