- Learn to upload a dataset to postgresql and query from this source.
- Work on files structure for projects
- Work on markdown skills :-)
- Practice classification models available in sckit-learn.
https://archive.ics.uci.edu/ml/datasets/Bank+Marketing
-
The dataset used in the analysis is queried from a postgresql database. The data are not big enough to require this, however it was good practice. Alternatively, source data can be found in the
/Data
folder -
Directions on the appropriate order to run the files are present in the
/Directions
folder text file.
- Logistic Regression
- K Nearest Neighbors
- Random Forrest Classifier
- The data are contained in csv format
/Data/bank-full.csv
- I loaded it into a postgre database for practice, feel free to just use the data from the file.
- Run
/scripts/connect.py
this will run the data load. - Open the Jupyter Notebook file
/Notebooks/Bank-Marketing-Project-EDA
to view the analysis performed on the data before the classification models were run. - Run
/scripts/data-transformation.py
to clean the data into usable format. - Run the various classification models in
/Models/classification models.py