Predict Property Valuations

General Assembly Data Science Final Project - Client Sellable


My Capstone project



Project Introduction


Company Overview:

Sellable takes the hassle out of selling your home, they’re property experts that buy homes, make them look spectacular, and then sell it using the best agents in the business. Currently receiving valuation requests of over 200 properties a month.

Objectives:

  • To have an automated valuation model to accurately predict property valuations.

Key Requirements:

  • Data clean customer .cvs file
  • Obtain satellite imagery and calculate square lot area and built area
  • Scrape valid information from Web Sites that would be used for the model

Action Plan


  • Data Gathering and Resources




High Level Overview of Process




Architecture Process Flow




Sellable Property Dashboard






Valuation Model


The property Prediction was based on various features listed previously

Regression Models Tested were


  • Lasso(L1) – Helps in reducing over fitting by been aggressive with feature selection, this is in order to enhance the prediction accuracy. It adds a penalty equivalent to the absolute value of coefficients

  • Rigde(L2) – Where predictor features are highly correlated, it adds a penalty equivalent to the square of the coefficients

  • RandomForest – An example of an ensemble method, meaning it relies on the aggregating the results of an ensemble of simpler estimators.



Model Selection Choice




Model Result Output




Image Recognition


Start with the address longitude/Latitude and feed this to google maps using an API key obtained

  • Grab the Google Image data and save the image files 400x400
  • Display the grayscale image
  • Mask based grayscale colour of the built up area
  • Label and show the connected components in the mask
  • Label the regions and find which label contains the centroid.
  • Calculate the Lot Size meters_ppx=156543.03392*Math.cos(data.latitude[0]



Application Data Entry Screen Demo




Predicted Valuation



Written on March 18, 2019