The aim is to build a model which predicts sales based on the money spent on different platforms such as TV, radio, and newspaper for marketing by using simple linear regression and multiple linear regression.
Data Source : Kaggle.com
Python Libraries used
- Pandas
- NumPy
- Matplotib
- Seaborn
- from sklearn.model_selection import train_test_split
- from sklearn import metrics
Pre-processing operations 1 Checking for missing values 2. Checking for duplicate values 3. Checking for outliers/extreme values
Exploratory Data Analysis
- Distribution of the target variable
- How sales is related to other independent variables
- Correlation between the variables
Model Building
Prediction using:
- Simple Linear Regression
- Multiple Linear Regression