Context
This data was extracted from the 1994 Census bureau database by Ronny Kohavi and Barry Becker (Data Mining and Visualization, Silicon Graphics). A set of reasonably clean records was extracted using the following conditions: ((AAGE>16) && (AGI>100) && (AFNLWGT>1) && (HRSWK>0)). The prediction task is to determine whether a person makes over $50K a year.
In order to make the job better we used artificial Intelligence to automatically modify the columns.
Content
This Dataset contains the initial Dataset columns as well as the new ones obtained by feeding the original US Census Dataset to PredicSis.ai in order to automatically :
- Discretise continuous variables into relevant intervals.
- Group values of categorical variables together in order to reduce the modality of the variables.
Acknowledgements
https://archive.ics.uci.edu/ml/datasets/Census+Income
https://www.kaggle.com/uciml/adult-census-income
https://predicsis.ai
Inspiration
We want to see by how much auto ML/AI improves the data scientist work quality.