Predicting Bangkok Housing Prices
Predicting condo prices using web scraped datas
@kaggle.thedevastator_predicting_bangkok_condominium_prices_using_web
Predicting condo prices using web scraped datas
@kaggle.thedevastator_predicting_bangkok_condominium_prices_using_web
By [source]
Bangkok is one of the most popular tourist destinations in Southeast Asia, and its condominium market is booming. With so many options to choose from, how can you predict which condominium will be the best investment?
This dataset was collected from hipflat.com, one of the largest condominium listing websites in Thailand, in order to predict the prices of condominiums in Bangkok. The data includes information on location, year built, project area, number of buildings, number of units, price per square meter
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset can be used to predict the prices of condominiums in Bangkok, Thailand. The data includes the location, year built, project area, number of buildings, number of floors, number of units, price per square meter, and distance to various amenities
- Predictive pricing for condominiums in Bangkok
- Determining the most important features for predicting condominium prices
- Identifying trends in the Bangkok condominium market
If you use this dataset in your research, please credit the original authors.
Data SourceLicense
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
File: GBoost_best_feature_importances_.csv
Column name | Description |
---|---|
**** |
File: df_cleaned_for_ML_regression.csv
Column name | Description |
---|---|
district | The district of Bangkok where the condominium is located. (String) |
latitude | The latitude of the condominium. (Float) |
longitude | The longitude of the condominium. (Float) |
year_built | The year the condominium was built. (Integer) |
proj_area | The project area of the condominium. (Float) |
nbr_buildings | The number of buildings in the condominium. (Integer) |
nbr_floors | The number of floors in the condominium. (Integer) |
units | The number of units in the condominium. (Integer) |
hospital | The distance to the nearest hospital. (Float) |
price_sqm | The price per square meter. (Float) |
bld_age | The age of the condominium building. (Integer) |
dist_shop_1 | The distance to the nearest shop. (Float) |
dist_shop_2 | The distance to the nearest shop. (Float) |
dist_shop_3 | The distance to the nearest shop. (Float) |
dist_shop_4 | The distance to the nearest shop. (Float) |
dist_shop_5 | The distance to the nearest shop. (Float) |
dist_school_1 | The distance to the nearest school. (Float) |
dist_school_2 | The distance to the nearest school. (Float) |
dist_school_3 | The distance to the nearest school. (Float) |
dist_school_4 | The distance to the nearest school. (Float) |
dist_school_5 | The distance to the nearest school. (Float) |
dist_food_1 | The distance to the nearest food. (Float) |
dist_food_2 | The distance to the nearest food. (Float) |
dist_food_3 | The distance to the nearest food. (Float) |
dist_food_4 | The distance to the nearest food. (Float) |
dist_food_5 | The distance to the nearest food. (Float) |
tran_type2 | The type of transportation. (String) |
tran_type3 | The type of transportation. (String) |
tran_type4 | The type of transportation. (String) |
tran_type5 | The type of transportation. (String) |
tran_name1 | The name of the transportation. (String) |
tran_name2 | The name of the transportation. (String) |
tran_name3 | The name of the transportation. (String) |
tran_name4 | The name of the transportation. (String) |
tran_name5 | The name of the transportation. (String) |
dist_tran_1 | The distance to the transportation. (Float) |
dist_tran_2 | The distance to the transportation. (Float) |
dist_tran_3 | The distance to the transportation. (Float) |
dist_tran_4 | The distance to the transportation. (Float) |
dist_tran_5 | The distance to the transportation. (Float) |
Elevator | The number of elevators in the condominium. (Integer) |
Parking | The number of parking spaces in the condominium. (Integer) |
Security | The number of security guards in the condominium. (Integer) |
CCTV | The number of CCTV cameras in the condominium. (Integer) |
Pool | The number of pools in the condominium. (Integer) |
Sauna | The number of saunas in the condominium. (Integer) |
Gym | The number of gyms in the condominium. (Integer) |
Garden | The number of gardens in the condominium. (Integer) |
Playground | The number of playgrounds in the condominium. (Integer) |
Shop | The number of shops in the condominium. (Integer) |
Restaurant | The number of restaurants in the condominium. (Integer) |
Wifi | The number of WiFi hotspots in the condominium. (Integer) |
tran_type1 | The type of transportation. (String) |
File: df_completed.csv
Column name | Description |
---|---|
district | The district of Bangkok where the condominium is located. (String) |
latitude | The latitude of the condominium. (Float) |
longitude | The longitude of the condominium. (Float) |
year_built | The year the condominium was built. (Integer) |
proj_area | The project area of the condominium. (Float) |
nbr_buildings | The number of buildings in the condominium. (Integer) |
nbr_floors | The number of floors in the condominium. (Integer) |
units | The number of units in the condominium. (Integer) |
hospital | The distance to the nearest hospital. (Float) |
price_sqm | The price per square meter. (Float) |
shops | The number of shops in the condominium. (Integer) |
schools | The number of schools in the condominium. (Integer) |
restaurants | The number of restaurants in the condominium. (Integer) |
amenities | The number of amenities in the condominium. (Integer) |
transportation | The number of transportation options in the condominium. (Integer) |
change_last_q | The change in price from the last quarter. (Float) |
change_last_y | The change in price from the last year. (Float) |
rental_yield | The rental yield of the condominium. (Float) |
change_last_y_rental_price | The change in rental price from the last year. (Float) |
price_hist | The price history of the condominium. (String) |
File: df_0.csv
Column name | Description |
---|---|
Elevator | The number of elevators in the condominium. (Integer) |
Garden | The number of gardens in the condominium. (Integer) |
Gym | The number of gyms in the condominium. (Integer) |
Parking | The number of parking spaces in the condominium. (Integer) |
Playground | The number of playgrounds in the condominium. (Integer) |
Pool | The number of pools in the condominium. (Integer) |
Restaurant | The number of restaurants in the condominium. (Integer) |
Sauna | The number of saunas in the condominium. (Integer) |
Security | The number of security guards in the condominium. (Integer) |
Shop | The number of shops in the condominium. (Integer) |
Wifi | The number of WiFi hotspots in the condominium. (Integer) |
district | The district of Bangkok where the condominium is located. (String) |
latitude | The latitude of the condominium. (Float) |
longitude | The longitude of the condominium. (Float) |
nbr_buildings | The number of buildings in the condominium. (Integer) |
nbr_floors | The number of floors in the condominium. (Integer) |
price_hist | The price history of the condominium. (String) |
price_sqm | The price per square meter. (Float) |
proj_area | The project area of the condominium. (Float) |
units | The number of units in the condominium. (Integer) |
year_built | The year the condominium was built. (Integer) |
**CCTV** | The number of CCTV cameras in the condominium. (Integer) |
date | The date the data was collected. (Date) |
name | The name of the condominium. (String) |
value | The value of the condominium. (Float) |
File: df_100.csv
Column name | Description |
---|---|
**CCTV** | The number of CCTV cameras in the condominium. (Integer) |
Elevator | The number of elevators in the condominium. (Integer) |
Garden | The number of gardens in the condominium. (Integer) |
Gym | The number of gyms in the condominium. (Integer) |
Parking | The number of parking spaces in the condominium. (Integer) |
Playground | The number of playgrounds in the condominium. (Integer) |
Pool | The number of pools in the condominium. (Integer) |
Restaurant | The number of restaurants in the condominium. (Integer) |
Sauna | The number of saunas in the condominium. (Integer) |
Security | The number of security guards in the condominium. (Integer) |
Shop | The number of shops in the condominium. (Integer) |
Wifi | The number of WiFi hotspots in the condominium. (Integer) |
date | The date the data was collected. (Date) |
district | The district of Bangkok where the condominium is located. (String) |
latitude | The latitude of the condominium. (Float) |
longitude | The longitude of the condominium. (Float) |
name | The name of the condominium. (String) |
nbr_buildings | The number of buildings in the condominium. (Integer) |
nbr_floors | The number of floors in the condominium. (Integer) |
price_hist | The price history of the condominium. (String) |
price_sqm | The price per square meter. (Float) |
proj_area | The project area of the condominium. (Float) |
units | The number of units in the condominium. (Integer) |
value | The value of the condominium. (Float) |
year_built | The year the condominium was built. (Integer) |
File: df_1000.csv
Column name | Description |
---|---|
**CCTV** | The number of CCTV cameras in the condominium. (Integer) |
Elevator | The number of elevators in the condominium. (Integer) |
Garden | The number of gardens in the condominium. (Integer) |
Gym | The number of gyms in the condominium. (Integer) |
Parking | The number of parking spaces in the condominium. (Integer) |
Playground | The number of playgrounds in the condominium. (Integer) |
Pool | The number of pools in the condominium. (Integer) |
Restaurant | The number of restaurants in the condominium. (Integer) |
Sauna | The number of saunas in the condominium. (Integer) |
Security | The number of security guards in the condominium. (Integer) |
Shop | The number of shops in the condominium. (Integer) |
Wifi | The number of WiFi hotspots in the condominium. (Integer) |
date | The date the data was collected. (Date) |
district | The district of Bangkok where the condominium is located. (String) |
latitude | The latitude of the condominium. (Float) |
longitude | The longitude of the condominium. (Float) |
name | The name of the condominium. (String) |
nbr_buildings | The number of buildings in the condominium. (Integer) |
nbr_floors | The number of floors in the condominium. (Integer) |
price_hist | The price history of the condominium. (String) |
price_sqm | The price per square meter. (Float) |
proj_area | The project area of the condominium. (Float) |
units | The number of units in the condominium. (Integer) |
value | The value of the condominium. (Float) |
year_built | The year the condominium was built. (Integer) |
File: df_1018.csv
Column name | Description |
---|---|
**CCTV** | The number of CCTV cameras in the condominium. (Integer) |
Elevator | The number of elevators in the condominium. (Integer) |
Garden | The number of gardens in the condominium. (Integer) |
Gym | The number of gyms in the condominium. (Integer) |
Parking | The number of parking spaces in the condominium. (Integer) |
Playground | The number of playgrounds in the condominium. (Integer) |
Pool | The number of pools in the condominium. (Integer) |
Restaurant | The number of restaurants in the condominium. (Integer) |
Sauna | The number of saunas in the condominium. (Integer) |
Security | The number of security guards in the condominium. (Integer) |
Shop | The number of shops in the condominium. (Integer) |
Wifi | The number of WiFi hotspots in the condominium. (Integer) |
date | The date the data was collected. (Date) |
district | The district of Bangkok where the condominium is located. (String) |
latitude | The latitude of the condominium. (Float) |
longitude | The longitude of the condominium. (Float) |
name | The name of the condominium. (String) |
nbr_buildings | The number of buildings in the condominium. (Integer) |
nbr_floors | The number of floors in the condominium. (Integer) |
price_hist | The price history of the condominium. (String) |
price_sqm | The price per square meter. (Float) |
proj_area | The project area of the condominium. (Float) |
units | The number of units in the condominium. (Integer) |
value | The value of the condominium. (Float) |
year_built | The year the condominium was built. (Integer) |
File: df_200.csv
Column name | Description |
---|---|
**CCTV** | The number of CCTV cameras in the condominium. (Integer) |
Elevator | The number of elevators in the condominium. (Integer) |
Garden | The number of gardens in the condominium. (Integer) |
Gym | The number of gyms in the condominium. (Integer) |
Parking | The number of parking spaces in the condominium. (Integer) |
Playground | The number of playgrounds in the condominium. (Integer) |
Pool | The number of pools in the condominium. (Integer) |
Restaurant | The number of restaurants in the condominium. (Integer) |
Sauna | The number of saunas in the condominium. (Integer) |
Security | The number of security guards in the condominium. (Integer) |
Shop | The number of shops in the condominium. (Integer) |
Wifi | The number of WiFi hotspots in the condominium. (Integer) |
date | The date the data was collected. (Date) |
district | The district of Bangkok where the condominium is located. (String) |
latitude | The latitude of the condominium. (Float) |
longitude | The longitude of the condominium. (Float) |
name | The name of the condominium. (String) |
nbr_buildings | The number of buildings in the condominium. (Integer) |
nbr_floors | The number of floors in the condominium. (Integer) |
price_hist | The price history of the condominium. (String) |
price_sqm | The price per square meter. (Float) |
proj_area | The project area of the condominium. (Float) |
units | The number of units in the condominium. (Integer) |
value | The value of the condominium. (Float) |
year_built | The year the condominium was built. (Integer) |
File: df_300.csv
Column name | Description |
---|---|
**CCTV** | The number of CCTV cameras in the condominium. (Integer) |
Elevator | The number of elevators in the condominium. (Integer) |
Garden | The number of gardens in the condominium. (Integer) |
Gym | The number of gyms in the condominium. (Integer) |
Parking | The number of parking spaces in the condominium. (Integer) |
Playground | The number of playgrounds in the condominium. (Integer) |
Pool | The number of pools in the condominium. (Integer) |
Restaurant | The number of restaurants in the condominium. (Integer) |
Sauna | The number of saunas in the condominium. (Integer) |
Security | The number of security guards in the condominium. (Integer) |
Shop | The number of shops in the condominium. (Integer) |
Wifi | The number of WiFi hotspots in the condominium. (Integer) |
date | The date the data was collected. (Date) |
district | The district of Bangkok where the condominium is located. (String) |
latitude | The latitude of the condominium. (Float) |
longitude | The longitude of the condominium. (Float) |
name | The name of the condominium. (String) |
nbr_buildings | The number of buildings in the condominium. (Integer) |
nbr_floors | The number of floors in the condominium. (Integer) |
price_hist | The price history of the condominium. (String) |
price_sqm | The price per square meter. (Float) |
proj_area | The project area of the condominium. (Float) |
units | The number of units in the condominium. (Integer) |
value | The value of the condominium. (Float) |
year_built | The year the condominium was built. (Integer) |
If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit .
Anyone who has the link will be able to view this.