Giter VIP home page Giter VIP logo

praveen76 / house-prices--advanced-regression-techniques Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 600 KB

House-Prices--Advanced-Regression-Techniques project tackles a regression problem, predicting house prices using 80 input features like MSSubClass, LotFrontage, LotArea, etc. The target variable is SalePrice. Leveraging XgBoost, this project aims for accurate and advanced regression techniques in house price prediction.

Home Page: https://towardsmachinelearning.org/

License: GNU General Public License v3.0

Jupyter Notebook 100.00%
house-price-prediction regression-algorithms regression-analysis regression-models xgboost-algorithm

house-prices--advanced-regression-techniques's Introduction

House-Prices--Advanced-Regression-Techniques

Data: The dataset contains 80 input features, and 1 Target variable called SalePrice. We're expected to predict the sale price of different houses with various features such as MSSubClass, LotFrontage, LotArea, etc.

House Price Prediction

This is a Regression problem. You can import dataset from the following link to replicate the same results and follow along the experiement. We'll use XgBoost to solve this problem.

Instructions for Installation:

Dependencies: : You'll need to install below dependencies to run this project.

  • json: 2.0.9
  • pandas: 1.0.1
  • numpy: 1.18.1
  • matplotlib: 3.5.3
  • seaborn: 0.10.0
  • sklearn: 0.22.1

The code has been tested on Windows system. It should work well on other distributions but has not yet been tested.

In case of any issue with installation or otherwise, please contact me on Linkedin

Important learnings:

  • Wite a re-usable function to determine data type, Null Counts, Unique values, and Null_Percent in each variable and store in a dataframe.
  • Feature Engineering.
  • Easy method to check Null values across different features in dataset.
  • Encode rare categories using RareLabelEncoder.
  • Creating Class for temporal transformation that is compatible with SK_learn pipeline.
  • Building the Pre-Processing sklearn pipeline for data preprocessing such missing value imputation, feature engineering, data encoding, etc.
  • Calculate the feature importance
  • Automatic important feature selection using SelectFromModel.
  • Compare different model version such as Model without preprocessing data, Model with processed data, and Model with important variables only.

Contributing

If you have a Data Science mini-project that you'd like to share, please follow the guidelines in CONTRIBUTING.md.

Code of Conduct

Please adhere to our Code of Conduct in all your interactions with the project.

License

This project is licensed under the MIT License.

Contact

For questions or inquiries, feel free to contact me on Linkedin.

About Me:

Iโ€™m a seasoned Data Scientist and founder of TowardsMachineLearning.Org. I've worked on various Machine Learning, NLP, and cutting-edge deep learning frameworks to solve numerous business problems.

house-prices--advanced-regression-techniques's People

Contributors

praveen76 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.