Baselight

SDNist V1.3: Temporal Map Challenge Environment

National Institute of Standards and Technology

@usgov.national_institute_of_standard_sdnist_benchmark_and_ev_12cb4da6

Loading...
Loading...

About this Dataset

SDNist V1.3: Temporal Map Challenge Environment

SDNist (v1.3) is a set of benchmark data and metrics for the evaluation of synthetic data generators on structured tabular data. This version (1.3) reproduces the challenge environment from Sprints 2 and 3 of the Temporal Map Challenge. These benchmarks are distributed as a simple open-source python package to allow standardized and reproducible comparison of synthetic generator models on real world data and use cases. These data and metrics were developed for and vetted through the NIST PSCR Differential Privacy Temporal Map Challenge, where the evaluation tools, k-marginal and Higher Order Conjunction, proved effective in distinguishing competing models in the competition environment.SDNist is available via pip install: pip install sdnist==1.2.8 for Python >=3.6 or on the USNIST/Github. The sdnist Python module will download data from NIST as necessary, and users are not required to download data manually.
Organization: National Institute of Standards and Technology
Last updated: 2022-01-24T19:36:28.148643
Tags: benchmarks, differential-privacy, privacy, private-information-sharing, synthetic-data

Tables

Table 1

@usgov.national_institute_of_standard_sdnist_benchmark_and_ev_12cb4da6.table_1
  • 33.1 MB
  • 1348364 rows
  • 37 columns
Loading...

CREATE TABLE table_1 (
  "unnamed_0" BIGINT,
  "puma" VARCHAR,
  "year" BIGINT,
  "hhwt" DOUBLE,
  "gq" BIGINT,
  "perwt" DOUBLE,
  "sex" BIGINT,
  "age" BIGINT,
  "marst" BIGINT,
  "race" BIGINT,
  "hispan" BIGINT,
  "citizen" BIGINT,
  "speakeng" BIGINT,
  "hcovany" BIGINT,
  "hcovpriv" BIGINT,
  "hinsemp" BIGINT,
  "hinscaid" BIGINT,
  "hinscare" BIGINT,
  "educ" BIGINT,
  "empstat" BIGINT,
  "empstatd" BIGINT,
  "labforce" BIGINT,
  "wrklstwk" BIGINT,
  "absent" BIGINT,
  "looking" BIGINT,
  "availble" BIGINT,
  "wrkrecal" BIGINT,
  "workedyr" BIGINT,
  "inctot" BIGINT,
  "incwage" BIGINT,
  "incwelfr" BIGINT,
  "incinvst" BIGINT,
  "incearn" BIGINT,
  "poverty" BIGINT,
  "departs" BIGINT,
  "arrives" BIGINT,
  "sim_individual_id" BIGINT
);

Table 2

@usgov.national_institute_of_standard_sdnist_benchmark_and_ev_12cb4da6.table_2
  • 343.66 MB
  • 26035898 rows
  • 14 columns
Loading...

CREATE TABLE table_2 (
  "unnamed_0" BIGINT,
  "taxi_id" BIGINT,
  "shift" BIGINT,
  "company_id" BIGINT,
  "pickup_community_area" BIGINT,
  "dropoff_community_area" BIGINT,
  "payment_type" BIGINT,
  "trip_day_of_week" BIGINT,
  "trip_hour_of_day" BIGINT,
  "fare" BIGINT,
  "tips" BIGINT,
  "trip_total" BIGINT,
  "trip_seconds" BIGINT,
  "trip_miles" BIGINT
);

Share link

Anyone who has the link will be able to view this.