Baselight

Adult Census Income Data

Dataset contains train_data file, test_data file and description file

@kaggle.mikolajbabula_adult_census_income_data

Loading...
Loading...

About this Dataset

Adult Census Income Data

Below dataset contains three files:

adult_data.csv - train dataset
adult_test.csv - test dataset
adult_descr.csv - file with description of the data

In my kernel I start with looking what is presented in the dataset, what features are placed inside, what informations can be found and compared with eachothers, next I clean and prepare data into the form that is good for models I test. Starting with basic classification models, through hyperparameters tuning, ending on boosting algorithms I try to find best model, that is finally tested on the test dataset

Tables

Adult Data

@kaggle.mikolajbabula_adult_census_income_data.adult_data
  • 355.45 KB
  • 32560 rows
  • 15 columns
Loading...

CREATE TABLE adult_data (
  "n_39" BIGINT,
  "n__state_gov" VARCHAR,
  "n__77516" BIGINT,
  "n__bachelors" VARCHAR,
  "n__13" BIGINT,
  "n__never_married" VARCHAR,
  "n__adm_clerical" VARCHAR,
  "n__not_in_family" VARCHAR,
  "n__white" VARCHAR,
  "n__male" VARCHAR,
  "n__2174" BIGINT,
  "n__0" BIGINT,
  "n__40" BIGINT,
  "n__united_states" VARCHAR,
  "n__50k" VARCHAR
);

Adult Descr

@kaggle.mikolajbabula_adult_census_income_data.adult_descr
  • 6.95 KB
  • 106 rows
  • 2 columns
Loading...

CREATE TABLE adult_descr (
  "unnamed_0" VARCHAR,
  "n__this_was_extracted_from_the_census_bureau_database_found_at" VARCHAR
);

Adult Test

@kaggle.mikolajbabula_adult_census_income_data.adult_test
  • 508.03 KB
  • 16281 rows
  • 2 columns
Loading...

CREATE TABLE adult_test (
  "unnamed_0" VARCHAR,
  "n_1x3_cross_validator" VARCHAR
);

Share link

Anyone who has the link will be able to view this.