Easy Peasy: It's Lemon Squeezy
Predict Workers Compensation claims using highly realistic synthetic data
@kaggle.lucamassaron_easy_peasy_its_lemon_squeezy
Predict Workers Compensation claims using highly realistic synthetic data
@kaggle.lucamassaron_easy_peasy_its_lemon_squeezy
Workers' compensation is a form of insurance providing wage replacement and medical benefits to employees injured in the course of employment. In exchange for this coverage, the employee has to relinquish his right to sue his employer in the case of an incident. The system of collective liability was created to prevent employers from becoming insolvent as a result of high damage awards, and thus to ensure the security of compensation to the workers. Individual immunity is the necessary corollary to collective liability.
The data.csv file contains 54,000 insurance policies that you can use to train and validate your model.
Data fields
The data is fully synthetic and not specific to any legal jurisdiction or country. It has been created by Colin Priest for an in-class competition organized by the Actuaries Institute of Australia, Institute and Faculty of Actuaries and the Singapore Actuarial Society.
Using the data can you build a predictive model and validate it?
CREATE TABLE data (
"claimnumber" VARCHAR,
"datetimeofaccident" VARCHAR,
"datereported" VARCHAR,
"age" BIGINT,
"gender" VARCHAR,
"maritalstatus" VARCHAR,
"dependentchildren" BIGINT,
"dependentsother" BIGINT,
"weeklywages" DOUBLE,
"parttimefulltime" VARCHAR,
"hoursworkedperweek" DOUBLE,
"daysworkedperweek" BIGINT,
"claimdescriptionkeyword_0" DOUBLE,
"claimdescriptionkeyword_1" DOUBLE,
"claimdescriptionkeyword_2" DOUBLE,
"claimdescriptionkeyword_3" DOUBLE,
"claimdescriptionkeyword_4" DOUBLE,
"claimdescriptionkeyword_5" DOUBLE,
"claimdescriptionkeyword_6" DOUBLE,
"claimdescriptionkeyword_7" DOUBLE,
"claimdescriptionkeyword_8" DOUBLE,
"claimdescriptionkeyword_9" DOUBLE,
"claimdescriptionkeyword_10" DOUBLE,
"claimdescriptionkeyword_11" DOUBLE,
"initialincurredclaimscost" BIGINT,
"ultimateincurredclaimcost" DOUBLE
);Anyone who has the link will be able to view this.