COVID PPP Loan Data With Fraud Examples
Full data for PPP loans over USD 150k with examples of loan fraud
@kaggle.danb91_covid_ppp_loan_data_with_fraud_examples
Full data for PPP loans over USD 150k with examples of loan fraud
@kaggle.danb91_covid_ppp_loan_data_with_fraud_examples
In 2020 the US Government created the Paycheck Protection Program (PPP) where small and mid-sized businesses could apply for forgivable loans at one percent interest in order to keep employees on the payroll and the business running during pandemic lockdowns. However, it is well established that these programs became targets for fraud as the government was attempting to get the funding to business quickly to mitigate layoffs.
The US Government has since created a website and task force dedicated to program oversight and fraud detection. The three files in this data set are based on PPP oversight data along with research into publicly-available data about companies charged with PPP loan fraud. This is an initial publication and work is continuing on the data sets (along with planned analysis notebooks).
This data set contains three files:
All three data sets are created via this notebook.
CREATE TABLE ppp_data_dict (
"field_name" VARCHAR,
"field_description" VARCHAR
);CREATE TABLE ppp_fraud_cases (
"company" VARCHAR,
"state" VARCHAR,
"source" VARCHAR,
"suspects" VARCHAR
);CREATE TABLE ppp_over_150k (
"loannumber" BIGINT,
"dateapproved" TIMESTAMP,
"sbaofficecode" BIGINT,
"processingmethod" VARCHAR,
"borrowername" VARCHAR,
"borroweraddress" VARCHAR,
"borrowercity" VARCHAR,
"borrowerstate" VARCHAR,
"borrowerzip" VARCHAR,
"loanstatusdate" TIMESTAMP,
"loanstatus" VARCHAR,
"term" BIGINT,
"sbaguarantypercentage" BIGINT,
"initialapprovalamount" DOUBLE,
"currentapprovalamount" DOUBLE,
"undisbursedamount" DOUBLE,
"franchisename" VARCHAR,
"servicinglenderlocationid" BIGINT,
"servicinglendername" VARCHAR,
"servicinglenderaddress" VARCHAR,
"servicinglendercity" VARCHAR,
"servicinglenderstate" VARCHAR,
"servicinglenderzip" VARCHAR,
"ruralurbanindicator" VARCHAR,
"hubzoneindicator" VARCHAR,
"lmiindicator" VARCHAR,
"businessagedescription" VARCHAR,
"projectcity" VARCHAR,
"projectcountyname" VARCHAR,
"projectstate" VARCHAR,
"projectzip" VARCHAR,
"cd" VARCHAR,
"jobsreported" DOUBLE,
"naicscode" BIGINT,
"race" VARCHAR,
"ethnicity" VARCHAR,
"utilities_proceed" DOUBLE,
"payroll_proceed" DOUBLE,
"mortgage_interest_proceed" DOUBLE,
"rent_proceed" DOUBLE,
"refinance_eidl_proceed" DOUBLE,
"health_care_proceed" DOUBLE,
"debt_interest_proceed" DOUBLE,
"businesstype" VARCHAR,
"originatinglenderlocationid" BIGINT,
"originatinglender" VARCHAR,
"originatinglendercity" VARCHAR,
"originatinglenderstate" VARCHAR,
"gender" VARCHAR,
"veteran" VARCHAR,
"nonprofit" VARCHAR,
"forgivenessamount" DOUBLE,
"forgivenessdate" TIMESTAMP,
"approvaldiff" DOUBLE,
"notforgivenamount" DOUBLE,
"forgivenpercentage" DOUBLE,
"total_proceed" DOUBLE,
"proceed_diff" DOUBLE,
"utilities_proceed_pct" DOUBLE,
"payroll_proceed_pct" DOUBLE,
"mortgage_interest_proceed_pct" DOUBLE,
"rent_proceed_pct" DOUBLE,
"refinance_eidl_proceed_pct" DOUBLE,
"health_care_proceed_pct" DOUBLE,
"debt_interest_proceed_pct" DOUBLE,
"proceed_per_job" DOUBLE
);Anyone who has the link will be able to view this.