Baselight

UCI ML Datasets

Boston Housing Dataset

@kaggle.heptapod_uci_ml_datasets

Loading...
Loading...

About this Dataset

UCI ML Datasets

Context

This is a Data Set from UCI Machine Learning Repository which concerns housing values in suburbs of Boston.

Content

Number of Instances: 506

Attribute Information:

  1. CRIM: per capita crime rate by town
  2. ZN: proportion of residential land zoned for lots over 25,000 sq.ft.
  3. INDUS: proportion of non-retail business acres per town
  4. CHAS: Charles River dummy variable (= 1 if tract bounds river; 0 otherwise)
  5. NOX: nitric oxides concentration (parts per 10 million)
  6. RM: average number of rooms per dwelling
  7. AGE: proportion of owner-occupied units built prior to 1940
  8. DIS: weighted distances to five Boston employment centres
  9. RAD: index of accessibility to radial highways
  10. TAX: full-value property-tax rate per $10,000
  11. PTRATIO: pupil-teacher ratio by town
  12. B: 1000(Bk - 0.63)^2 where Bk is the proportion of blacks by town
  13. LSTAT: % lower status of the population
  14. MEDV: Median value of owner-occupied homes in $1000's

Missing Attribute Values: None

Acknowledgements

Source: https://archive.ics.uci.edu/ml/datasets/Housing
Origin: This dataset was taken from the StatLib library which is maintained at Carnegie Mellon University.
Creator:
Harrison, D. and Rubinfeld, D.L.
'Hedonic prices and the demand for clean air', J. Environ. Economics & Management, vol.5, 81-102, 1978.

Tables

Hou All

@kaggle.heptapod_uci_ml_datasets.hou_all
  • 32.33 KB
  • 505 rows
  • 15 columns
Loading...

CREATE TABLE hou_all (
  "n_0_00632" DOUBLE,
  "n_18" DOUBLE,
  "n_2_31" DOUBLE,
  "n_0" BIGINT,
  "n_0_538" DOUBLE,
  "n_6_575" DOUBLE,
  "n_65_2" DOUBLE,
  "n_4_09" DOUBLE,
  "n_1" BIGINT,
  "n_296" BIGINT,
  "n_15_3" DOUBLE,
  "n_396_9" DOUBLE,
  "n_4_98" DOUBLE,
  "n_24" DOUBLE,
  "n_1_1" BIGINT
);

Share link

Anyone who has the link will be able to view this.