Baselight

Dataset: Predictions Of Cyanobacteria And Microcystin In Lakes Across The Conterminous United States

U.S. Environmental Protection Agency

@usgov.epa_gov_dataset_predictions_of_cyanobacteria_and_micro_12dc6412

Loading...
Loading...

About this Dataset

Dataset: Predictions Of Cyanobacteria And Microcystin In Lakes Across The Conterminous United States

With increasing concerns about freshwater cyanobacteria blooms, there is a need to identify which waterbodies are at risk for developing these blooms, especially those that produce cyanotoxins. To address this concern, we developed spatial statistical models using the US National Lakes Assessment, a survey with over 3,000 spring and summer observations of cyanobacteria abundance and microcystin concentration in lakes across the conterminous US. We combined these observations with other nationally available data to model which lake and watershed factors best explain the presence of harmful cyanobacterial blooms. We then used these models to estimate the cyanobacteria abundance and probability of microcystin detection in 124,500 lakes across the CONUS. This dataset includes the compiled data used to generate the models and the dataset used to generate prediction for a much larger population of lakes. The data package includes two tabular data files, two tabular metadata files, and one methods document.
Organization: U.S. Environmental Protection Agency
Last updated: 2025-06-27T12:21:26.953961
Tags: cyanobacteria, cyanotoxins, harmful-algae-blooms, lakes

Tables

HABsDrivers Model Data

@usgov.epa_gov_dataset_predictions_of_cyanobacteria_and_micro_12dc6412.habsdrivers_model_data
  • 765.31 KB
  • 3664 rows
  • 72 columns
Loading...

CREATE TABLE habsdrivers_model_data (
  "site_id" VARCHAR,
  "visit_no" BIGINT,
  "unique_id" VARCHAR,
  "dsgn_cycle" BIGINT,
  "date_col" TIMESTAMP,
  "lat_dd83" DOUBLE,
  "lon_dd83" DOUBLE,
  "comid" BIGINT,
  "temperature" VARCHAR,
  "maxdepth" VARCHAR,
  "stratified" BIGINT,
  "ammonia_n" VARCHAR,
  "do_surf" VARCHAR,
  "doc" VARCHAR,
  "ntl" DOUBLE,
  "ptl" VARCHAR,
  "turb" VARCHAR,
  "nitrate_n" VARCHAR,
  "ph" DOUBLE,
  "chla_result" VARCHAR,
  "micx" VARCHAR,
  "micx_det" VARCHAR,
  "b_g_dens" VARCHAR,
  "evap_infl" VARCHAR,
  "d_excess" VARCHAR,
  "agr_ws" VARCHAR,
  "dev_ws" VARCHAR,
  "fst_ws" VARCHAR,
  "wet_ws" VARCHAR,
  "precip_mean_month" VARCHAR,
  "temp_mean_month" VARCHAR,
  "lakemorpho_fetch" VARCHAR,
  "bfiws" VARCHAR,
  "agkffactws" VARCHAR,
  "kffactws" VARCHAR,
  "runoffws" VARCHAR,
  "precip_minus_evtws" VARCHAR,
  "precip8110ws" VARCHAR,
  "tmean8110ws" VARCHAR,
  "elevws" VARCHAR,
  "slopews" VARCHAR,
  "n_cbnf" VARCHAR,
  "n_crop_n_rem" VARCHAR,
  "n_fert_farm" VARCHAR,
  "n_fert_urban" VARCHAR,
  "n_forest_fire" VARCHAR,
  "n_human_n_demand" VARCHAR,
  "n_human_waste" VARCHAR,
  "n_manure_recov" VARCHAR,
  "n_rock" VARCHAR,
  "n_surplus" VARCHAR,
  "n_total_deposition" VARCHAR,
  "n_total_inputs" VARCHAR,
  "n_total_outputs" VARCHAR,
  "n_total_nbnf" VARCHAR,
  "n_livestock_waste" VARCHAR,
  "p_accumulated_ag_inputs" VARCHAR,
  "p_crop_removal" VARCHAR,
  "p_deposition" VARCHAR,
  "p_legacy_p" VARCHAR,
  "p_recovered_p" VARCHAR,
  "p_surplus" VARCHAR,
  "p_f_fertilizer" VARCHAR,
  "p_human_waste_kg" VARCHAR,
  "p_livestock_waste" VARCHAR,
  "p_nf_fertilizer" VARCHAR,
  "n_farm_inputs" VARCHAR,
  "n_dev_inputs" VARCHAR,
  "p_farm_inputs" VARCHAR,
  "p_dev_inputs" VARCHAR,
  "ag_eco3" VARCHAR,
  "ag_eco9_nm" VARCHAR
);

HABsDrivers Model Metadata

@usgov.epa_gov_dataset_predictions_of_cyanobacteria_and_micro_12dc6412.habsdrivers_model_metadata
  • 10.01 KB
  • 72 rows
  • 7 columns
Loading...

CREATE TABLE habsdrivers_model_metadata (
  "variable" VARCHAR,
  "description" VARCHAR,
  "type" VARCHAR,
  "values" VARCHAR,
  "units" VARCHAR,
  "missingdatavalue" VARCHAR,
  "missingdatameaning" VARCHAR
);

HABsDrivers Pred Data

@usgov.epa_gov_dataset_predictions_of_cyanobacteria_and_micro_12dc6412.habsdrivers_pred_data
  • 8.75 MB
  • 124529 rows
  • 28 columns
Loading...

CREATE TABLE habsdrivers_pred_data (
  "comid" BIGINT,
  "dsgn_cycle" BIGINT,
  "unique_id" VARCHAR,
  "n_dev_inputs" DOUBLE,
  "n_farm_inputs" DOUBLE,
  "p_dev_inputs" DOUBLE,
  "p_farm_inputs" DOUBLE,
  "bfiws" DOUBLE,
  "maxdepth" DOUBLE,
  "lakemorpho_fetch" BIGINT,
  "precip8110ws" DOUBLE,
  "tmean8110ws" DOUBLE,
  "fst_ws" DOUBLE,
  "ag_eco3" VARCHAR,
  "micx_prob_fit" DOUBLE,
  "micx_prob_lwr" DOUBLE,
  "micx_prob_upr" DOUBLE,
  "micx_logit_fit" DOUBLE,
  "micx_logit_lwr" BIGINT,
  "micx_logit_upr" DOUBLE,
  "cyano_log_fit" DOUBLE,
  "cyano_log_lwr" DOUBLE,
  "cyano_log_upr" DOUBLE,
  "cyano_abun_fit" DOUBLE,
  "cyano_abun_lwr" DOUBLE,
  "cyano_abun_upr" DOUBLE,
  "latitude_epsg5072" DOUBLE,
  "logitude_epsg5072" DOUBLE
);

HABsDrivers Pred Metadata

@usgov.epa_gov_dataset_predictions_of_cyanobacteria_and_micro_12dc6412.habsdrivers_pred_metadata
  • 8.38 KB
  • 28 rows
  • 8 columns
Loading...

CREATE TABLE habsdrivers_pred_metadata (
  "variable" VARCHAR,
  "description" VARCHAR,
  "type" VARCHAR,
  "values" VARCHAR,
  "units" VARCHAR,
  "source" VARCHAR,
  "missingdatavalue" VARCHAR,
  "missingdatameaning" VARCHAR
);

Share link

Anyone who has the link will be able to view this.