Giter VIP home page Giter VIP logo

housing-price-predictions's Introduction

Housing-Price-Prediction-using-ML-algorithms

This Dataset has been taken from kaggle.com The source of the data set is given as follows: https://www.kaggle.com/c/house-prices-advanced-regression-techniques/data

  • train.csv - the training set
  • test.csv - the test set
  • In this data set, we need to predict which property has the best SalePrice using different models. We are trying to find which models give the best result.

Model Summary

We see here that:

  • In Bagging regressor, both Decision tree and Random Forest regressors over fits the data
  • In Pasting regressor, SVR with linear Kernel underfits the data while Decision Tree overfits the data.
  • In AdaBoost regressor, Decision Tree model over fits the data and SVR model underfits the data
  • In Gradient Boosting Regressor, the model overfits the data
  • Deep Learning model here gives us the best result with 90% train score and 86% test score.
  • Hence our best model so far which has properly fit the data has been Perceptron and MLP

    Analysis of RMSE: The RMSE score for the reduced dataset is somewhat similar than the score of the unreduced dataset(exception – Ridge and Lasso where the RMSE for the unreduced data is more than for the unreduced dataset)We have used 5 PC for our different models. There is not much difference in the scores for the reduced and unreduced data, maybe because we have taken fewer number of Principal Components. We see that our model over fits the data even after using PCA) The above results shows that irrespective of performing PCA on the models, almost all models overfits the data. The Ridge regressor model (without PCA) with 78% train score and 68% test score can be considered as the optimum model here as we can say it somewhat fits the data.

    Best Model: The best feature according to our Ridge regressor has been the LotArea followed by LotFrontage.

    Deep Learning: After performing Ensemble learning and Neural Networks, we see that even Ensemble Learning couldn’t remove the overfitting problem. However, we could see that the Deep Learning model gave 86% test score against 90% train score. Hence, this model had perfectly fit the data.

housing-price-predictions's People

Contributors

niadel91 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.