Giter VIP home page Giter VIP logo

vinodkumar-yerraballi / salary_prediction_project Goto Github PK

View Code? Open in Web Editor NEW
7.0 1.0 2.0 208.46 MB

Home Page: https://salarypredictionproject-2g8cme26awavyozazmas89.streamlit.app/

Python 89.17% Shell 0.01% Batchfile 0.01% HTML 0.06% CSS 0.02% PowerShell 0.01% Perl 0.02% C 3.00% C++ 4.05% DTrace 0.01% Tcl 1.85% Makefile 0.05% F# 0.01% C# 0.01% GSC 0.11% GLSL 0.01% Dockerfile 0.01% Roff 0.01% JavaScript 0.18% Jupyter Notebook 1.46%
classification jupyter-notebook machine-learning python salary-prediction

salary_prediction_project's Introduction

Salary_Prediction_Project

๐Ÿ“Œ About the Project

This is an end-to-end machine learning project aimed at creating a web app for predicting an individual's salary.

๐Ÿ“Œ Data Preprocessing and Exploratory Data Analysis ๐Ÿ“ˆ ๐Ÿ“Š ๐Ÿ“‰

Every project begins with data collection. We obtained the dataset from the Kaggle site and installed all the required libraries for this project. We performed data preprocessing steps, such as checking for null and duplicate values.

The data didn't contain any null values but had duplicate entries, which we removed. Additionally, we dealt with unwanted values in certain columns, replacing them with "others" as appropriate. We then embarked on Exploratory Data Analysis (EDA) to gain insights from the data. We explored exciting questions, like the distribution of working hours per week, the most demanded degree and education, and the job with the highest working hours. Furthermore, we investigated the average age for each gender. We utilized pie, bar, box, and histogram charts to visualize these insights.

๐Ÿ“Œ Machine Learning Process

We installed the required libraries and converted categorical values into numerical ones using label encoding. Next, we divided the data into dependent and independent variables. After that, we normalized the data using the StandardScaler. The dataset was then split into training and testing sets, with 20% of the data reserved for training. We created a function for the machine learning model and applied various classification algorithms, including logistic regression, Random Forest, and XGBoost. Among these, the Random Forest algorithm achieved an accuracy score of 87%. We saved the model using the Pickle library.

๐Ÿ“Œ Machine Learning Model Deployment

For model deployment, we used Streamlit, a user-friendly web app framework. In the app.py file, we loaded the Pickle files and utilized Streamlit's built-in functions like select boxes and number inputs to take user inputs. We normalized the data as required, used the Pickle model to make predictions, and displayed the results on the Streamlit app. Finally, we deployed the web app on the Streamlit Cloud platform, making it accessible as a website.

Web App: https://salarypredictionproject-2g8cme26awavyozazmas89.streamlit.app/

Watch the Video

salary_prediction_project's People

Contributors

vinodkumar-yerraballi avatar

Stargazers

 avatar  avatar Satyam Singh avatar Davi Feliz avatar  avatar Lieutenant  avatar Tiago Webber avatar

Watchers

 avatar

Forkers

twebberbr fersuax

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.