Baselight

California Housing Prices Dataset

Housing data from the 1990 California census.

@kaggle.rehan497_california_housing_dataset

Loading...
Loading...

About this Dataset

California Housing Prices Dataset

📘 Context

This dataset contains housing data from the 1990 U.S. Census in California.
Each row represents a district with details like income, rooms, population, and location.
It helps in understanding how different factors affect house prices.


📂 Source

Originally from the StatLib repository, later included in the Scikit-Learn library as fetch_california_housing.
Published by Pace, R. Kelley and Barry, R. Barry (1997) — Sparse Spatial Autoregressions.


💡 Inspiration

  • Predict median house prices using demographic and geographic data.
  • Practice EDA, train-test split, and data scaling.
  • Build and evaluate regression models like Linear Regression or Random Forest.

Tables

Housing

@kaggle.rehan497_california_housing_dataset.housing
  • 415.11 kB
  • 20,640 rows
  • 10 columns
Loading...
CREATE TABLE housing (
  "longitude" DOUBLE,
  "latitude" DOUBLE,
  "housing_median_age" DOUBLE,
  "total_rooms" DOUBLE,
  "total_bedrooms" DOUBLE,
  "population" DOUBLE,
  "households" DOUBLE,
  "median_income" DOUBLE,
  "median_house_value" DOUBLE,
  "ocean_proximity" VARCHAR
);

Share link

Anyone who has the link will be able to view this.