Predict Property Valuations
General Assembly Data Science Final Project - Client Sellable
My Capstone project
Project Introduction
Company Overview:
Sellable takes the hassle out of selling your home, they’re property experts that buy homes, make them look spectacular, and then sell it using the best agents in the business. Currently receiving valuation requests of over 200 properties a month.
Objectives:
- To have an automated valuation model to accurately predict property valuations.
Key Requirements:
- Data clean customer .cvs file
- Obtain satellite imagery and calculate square lot area and built area
- Scrape valid information from Web Sites that would be used for the model
Action Plan
- Data Gathering and Resources
High Level Overview of Process
Architecture Process Flow
Sellable Property Dashboard
Valuation Model
The property Prediction was based on various features listed previously
Regression Models Tested were
-
Lasso(L1) – Helps in reducing over fitting by been aggressive with feature selection, this is in order to enhance the prediction accuracy. It adds a penalty equivalent to the absolute value of coefficients
-
Rigde(L2) – Where predictor features are highly correlated, it adds a penalty equivalent to the square of the coefficients
-
RandomForest – An example of an ensemble method, meaning it relies on the aggregating the results of an ensemble of simpler estimators.
Model Selection Choice
Model Result Output
Image Recognition
Start with the address longitude/Latitude and feed this to google maps using an API key obtained
- Grab the Google Image data and save the image files 400x400
- Display the grayscale image
- Mask based grayscale colour of the built up area
- Label and show the connected components in the mask
- Label the regions and find which label contains the centroid.
- Calculate the Lot Size meters_ppx=156543.03392*Math.cos(data.latitude[0]
Application Data Entry Screen Demo
Predicted Valuation