House Price Prediction Challenge
Overview
Welcome to the House Price Prediction Challenge, you will test your regression skills by designing an algorithm to accurately predict the house prices in India. Accurately predicting house prices can be a daunting task. The buyers are just not concerned about the size(square feet) of the house and there are various other factors that play a key role to decide the price of a house/property. It can be extremely difficult to figure out the right set of attributes that are contributing to understanding the buyer's behavior as such. This dataset has been collected across various property aggregators across India. In this competition, provided the 12 influencing factors your role as a data scientist is to predict the prices as accurately as possible.
Also, in this competition, you will get a lot of room for feature engineering and mastering advanced regression techniques such as Random Forest, Deep Neural Nets, and various other ensembling techniques.
Data Description:
Train.csv - 29451 rows x 12 columns
Test.csv - 68720 rows x 11 columns
Sample Submission - Acceptable submission format. (.csv/.xlsx file with 68720 rows)
Attributes Description:
Column |
Description |
POSTED_BY |
Category marking who has listed the property |
UNDER_CONSTRUCTION |
Under Construction or Not |
RERA |
Rera approved or Not |
BHK_NO |
Number of Rooms |
BHK_OR_RK |
Type of property |
SQUARE_FT |
Total area of the house in square feet |
READY_TO_MOVE |
Category marking Ready to move or Not |
RESALE |
Category marking Resale or not |
ADDRESS |
Address of the property |
LONGITUDE |
Longitude of the property |
LATITUDE |
Latitude of the property |
ACKNOWLEDGMENT:
The dataset for this hackathon was contributed by Devrup Banerjee . We would like to appreciate his efforts for this contribution to the Machinehack community.