In this project, an MLP model is applied to solve a classification problem.
The dataset is from https://www.kaggle.com/carlolepelaars/toy-dataset.
A fictional dataset for exploratory data analysis (EDA) and to test simple prediction models. This toy dataset features 150000 rows and 6 columns.
- Note: All data is fictional. The data has been generated so that their distributions are convenient for statistical analysis.
- Number: A simple index number for each row
- City: The location of a person (Dallas, New York City, Los Angeles, Mountain View, Boston, Washington D.C., San Diego and Austin)
- Gender: Gender of a person (Male or Female)
- Age: The age of a person (Ranging from 25 to 65 years)
- Income: Annual income of a person (Ranging from -674 to 177175)
- Illness: Is the person Ill? (Yes or No)
- Pandas
- Scikit
python pipeline.py