Baselight

Predicting Bangkok Housing Prices

Predicting condo prices using web scraped datas

@kaggle.thedevastator_predicting_bangkok_condominium_prices_using_web

About this Dataset

Predicting Bangkok Housing Prices


Predicting Bangkok Housing Prices

Predicting condo prices using web scraped datas

By [source]


About this dataset

Bangkok is one of the most popular tourist destinations in Southeast Asia, and its condominium market is booming. With so many options to choose from, how can you predict which condominium will be the best investment?

This dataset was collected from hipflat.com, one of the largest condominium listing websites in Thailand, in order to predict the prices of condominiums in Bangkok. The data includes information on location, year built, project area, number of buildings, number of units, price per square meter

More Datasets

For more datasets, click here.

Featured Notebooks

  • 🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset can be used to predict the prices of condominiums in Bangkok, Thailand. The data includes the location, year built, project area, number of buildings, number of floors, number of units, price per square meter, and distance to various amenities

Research Ideas

  • Predictive pricing for condominiums in Bangkok
  • Determining the most important features for predicting condominium prices
  • Identifying trends in the Bangkok condominium market

Acknowledgements

If you use this dataset in your research, please credit the original authors.
Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: GBoost_best_feature_importances_.csv

Column name Description
****

File: df_cleaned_for_ML_regression.csv

Column name Description
district The district of Bangkok where the condominium is located. (String)
latitude The latitude of the condominium. (Float)
longitude The longitude of the condominium. (Float)
year_built The year the condominium was built. (Integer)
proj_area The project area of the condominium. (Float)
nbr_buildings The number of buildings in the condominium. (Integer)
nbr_floors The number of floors in the condominium. (Integer)
units The number of units in the condominium. (Integer)
hospital The distance to the nearest hospital. (Float)
price_sqm The price per square meter. (Float)
bld_age The age of the condominium building. (Integer)
dist_shop_1 The distance to the nearest shop. (Float)
dist_shop_2 The distance to the nearest shop. (Float)
dist_shop_3 The distance to the nearest shop. (Float)
dist_shop_4 The distance to the nearest shop. (Float)
dist_shop_5 The distance to the nearest shop. (Float)
dist_school_1 The distance to the nearest school. (Float)
dist_school_2 The distance to the nearest school. (Float)
dist_school_3 The distance to the nearest school. (Float)
dist_school_4 The distance to the nearest school. (Float)
dist_school_5 The distance to the nearest school. (Float)
dist_food_1 The distance to the nearest food. (Float)
dist_food_2 The distance to the nearest food. (Float)
dist_food_3 The distance to the nearest food. (Float)
dist_food_4 The distance to the nearest food. (Float)
dist_food_5 The distance to the nearest food. (Float)
tran_type2 The type of transportation. (String)
tran_type3 The type of transportation. (String)
tran_type4 The type of transportation. (String)
tran_type5 The type of transportation. (String)
tran_name1 The name of the transportation. (String)
tran_name2 The name of the transportation. (String)
tran_name3 The name of the transportation. (String)
tran_name4 The name of the transportation. (String)
tran_name5 The name of the transportation. (String)
dist_tran_1 The distance to the transportation. (Float)
dist_tran_2 The distance to the transportation. (Float)
dist_tran_3 The distance to the transportation. (Float)
dist_tran_4 The distance to the transportation. (Float)
dist_tran_5 The distance to the transportation. (Float)
Elevator The number of elevators in the condominium. (Integer)
Parking The number of parking spaces in the condominium. (Integer)
Security The number of security guards in the condominium. (Integer)
CCTV The number of CCTV cameras in the condominium. (Integer)
Pool The number of pools in the condominium. (Integer)
Sauna The number of saunas in the condominium. (Integer)
Gym The number of gyms in the condominium. (Integer)
Garden The number of gardens in the condominium. (Integer)
Playground The number of playgrounds in the condominium. (Integer)
Shop The number of shops in the condominium. (Integer)
Restaurant The number of restaurants in the condominium. (Integer)
Wifi The number of WiFi hotspots in the condominium. (Integer)
tran_type1 The type of transportation. (String)

File: df_completed.csv

Column name Description
district The district of Bangkok where the condominium is located. (String)
latitude The latitude of the condominium. (Float)
longitude The longitude of the condominium. (Float)
year_built The year the condominium was built. (Integer)
proj_area The project area of the condominium. (Float)
nbr_buildings The number of buildings in the condominium. (Integer)
nbr_floors The number of floors in the condominium. (Integer)
units The number of units in the condominium. (Integer)
hospital The distance to the nearest hospital. (Float)
price_sqm The price per square meter. (Float)
shops The number of shops in the condominium. (Integer)
schools The number of schools in the condominium. (Integer)
restaurants The number of restaurants in the condominium. (Integer)
amenities The number of amenities in the condominium. (Integer)
transportation The number of transportation options in the condominium. (Integer)
change_last_q The change in price from the last quarter. (Float)
change_last_y The change in price from the last year. (Float)
rental_yield The rental yield of the condominium. (Float)
change_last_y_rental_price The change in rental price from the last year. (Float)
price_hist The price history of the condominium. (String)

File: df_0.csv

Column name Description
Elevator The number of elevators in the condominium. (Integer)
Garden The number of gardens in the condominium. (Integer)
Gym The number of gyms in the condominium. (Integer)
Parking The number of parking spaces in the condominium. (Integer)
Playground The number of playgrounds in the condominium. (Integer)
Pool The number of pools in the condominium. (Integer)
Restaurant The number of restaurants in the condominium. (Integer)
Sauna The number of saunas in the condominium. (Integer)
Security The number of security guards in the condominium. (Integer)
Shop The number of shops in the condominium. (Integer)
Wifi The number of WiFi hotspots in the condominium. (Integer)
district The district of Bangkok where the condominium is located. (String)
latitude The latitude of the condominium. (Float)
longitude The longitude of the condominium. (Float)
nbr_buildings The number of buildings in the condominium. (Integer)
nbr_floors The number of floors in the condominium. (Integer)
price_hist The price history of the condominium. (String)
price_sqm The price per square meter. (Float)
proj_area The project area of the condominium. (Float)
units The number of units in the condominium. (Integer)
year_built The year the condominium was built. (Integer)
**CCTV** The number of CCTV cameras in the condominium. (Integer)
date The date the data was collected. (Date)
name The name of the condominium. (String)
value The value of the condominium. (Float)

File: df_100.csv

Column name Description
**CCTV** The number of CCTV cameras in the condominium. (Integer)
Elevator The number of elevators in the condominium. (Integer)
Garden The number of gardens in the condominium. (Integer)
Gym The number of gyms in the condominium. (Integer)
Parking The number of parking spaces in the condominium. (Integer)
Playground The number of playgrounds in the condominium. (Integer)
Pool The number of pools in the condominium. (Integer)
Restaurant The number of restaurants in the condominium. (Integer)
Sauna The number of saunas in the condominium. (Integer)
Security The number of security guards in the condominium. (Integer)
Shop The number of shops in the condominium. (Integer)
Wifi The number of WiFi hotspots in the condominium. (Integer)
date The date the data was collected. (Date)
district The district of Bangkok where the condominium is located. (String)
latitude The latitude of the condominium. (Float)
longitude The longitude of the condominium. (Float)
name The name of the condominium. (String)
nbr_buildings The number of buildings in the condominium. (Integer)
nbr_floors The number of floors in the condominium. (Integer)
price_hist The price history of the condominium. (String)
price_sqm The price per square meter. (Float)
proj_area The project area of the condominium. (Float)
units The number of units in the condominium. (Integer)
value The value of the condominium. (Float)
year_built The year the condominium was built. (Integer)

File: df_1000.csv

Column name Description
**CCTV** The number of CCTV cameras in the condominium. (Integer)
Elevator The number of elevators in the condominium. (Integer)
Garden The number of gardens in the condominium. (Integer)
Gym The number of gyms in the condominium. (Integer)
Parking The number of parking spaces in the condominium. (Integer)
Playground The number of playgrounds in the condominium. (Integer)
Pool The number of pools in the condominium. (Integer)
Restaurant The number of restaurants in the condominium. (Integer)
Sauna The number of saunas in the condominium. (Integer)
Security The number of security guards in the condominium. (Integer)
Shop The number of shops in the condominium. (Integer)
Wifi The number of WiFi hotspots in the condominium. (Integer)
date The date the data was collected. (Date)
district The district of Bangkok where the condominium is located. (String)
latitude The latitude of the condominium. (Float)
longitude The longitude of the condominium. (Float)
name The name of the condominium. (String)
nbr_buildings The number of buildings in the condominium. (Integer)
nbr_floors The number of floors in the condominium. (Integer)
price_hist The price history of the condominium. (String)
price_sqm The price per square meter. (Float)
proj_area The project area of the condominium. (Float)
units The number of units in the condominium. (Integer)
value The value of the condominium. (Float)
year_built The year the condominium was built. (Integer)

File: df_1018.csv

Column name Description
**CCTV** The number of CCTV cameras in the condominium. (Integer)
Elevator The number of elevators in the condominium. (Integer)
Garden The number of gardens in the condominium. (Integer)
Gym The number of gyms in the condominium. (Integer)
Parking The number of parking spaces in the condominium. (Integer)
Playground The number of playgrounds in the condominium. (Integer)
Pool The number of pools in the condominium. (Integer)
Restaurant The number of restaurants in the condominium. (Integer)
Sauna The number of saunas in the condominium. (Integer)
Security The number of security guards in the condominium. (Integer)
Shop The number of shops in the condominium. (Integer)
Wifi The number of WiFi hotspots in the condominium. (Integer)
date The date the data was collected. (Date)
district The district of Bangkok where the condominium is located. (String)
latitude The latitude of the condominium. (Float)
longitude The longitude of the condominium. (Float)
name The name of the condominium. (String)
nbr_buildings The number of buildings in the condominium. (Integer)
nbr_floors The number of floors in the condominium. (Integer)
price_hist The price history of the condominium. (String)
price_sqm The price per square meter. (Float)
proj_area The project area of the condominium. (Float)
units The number of units in the condominium. (Integer)
value The value of the condominium. (Float)
year_built The year the condominium was built. (Integer)

File: df_200.csv

Column name Description
**CCTV** The number of CCTV cameras in the condominium. (Integer)
Elevator The number of elevators in the condominium. (Integer)
Garden The number of gardens in the condominium. (Integer)
Gym The number of gyms in the condominium. (Integer)
Parking The number of parking spaces in the condominium. (Integer)
Playground The number of playgrounds in the condominium. (Integer)
Pool The number of pools in the condominium. (Integer)
Restaurant The number of restaurants in the condominium. (Integer)
Sauna The number of saunas in the condominium. (Integer)
Security The number of security guards in the condominium. (Integer)
Shop The number of shops in the condominium. (Integer)
Wifi The number of WiFi hotspots in the condominium. (Integer)
date The date the data was collected. (Date)
district The district of Bangkok where the condominium is located. (String)
latitude The latitude of the condominium. (Float)
longitude The longitude of the condominium. (Float)
name The name of the condominium. (String)
nbr_buildings The number of buildings in the condominium. (Integer)
nbr_floors The number of floors in the condominium. (Integer)
price_hist The price history of the condominium. (String)
price_sqm The price per square meter. (Float)
proj_area The project area of the condominium. (Float)
units The number of units in the condominium. (Integer)
value The value of the condominium. (Float)
year_built The year the condominium was built. (Integer)

File: df_300.csv

Column name Description
**CCTV** The number of CCTV cameras in the condominium. (Integer)
Elevator The number of elevators in the condominium. (Integer)
Garden The number of gardens in the condominium. (Integer)
Gym The number of gyms in the condominium. (Integer)
Parking The number of parking spaces in the condominium. (Integer)
Playground The number of playgrounds in the condominium. (Integer)
Pool The number of pools in the condominium. (Integer)
Restaurant The number of restaurants in the condominium. (Integer)
Sauna The number of saunas in the condominium. (Integer)
Security The number of security guards in the condominium. (Integer)
Shop The number of shops in the condominium. (Integer)
Wifi The number of WiFi hotspots in the condominium. (Integer)
date The date the data was collected. (Date)
district The district of Bangkok where the condominium is located. (String)
latitude The latitude of the condominium. (Float)
longitude The longitude of the condominium. (Float)
name The name of the condominium. (String)
nbr_buildings The number of buildings in the condominium. (Integer)
nbr_floors The number of floors in the condominium. (Integer)
price_hist The price history of the condominium. (String)
price_sqm The price per square meter. (Float)
proj_area The project area of the condominium. (Float)
units The number of units in the condominium. (Integer)
value The value of the condominium. (Float)
year_built The year the condominium was built. (Integer)

Acknowledgements

If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit .

Share link

Anyone who has the link will be able to view this.