Giter VIP home page Giter VIP logo

manaralharbi / dsnd-term1-identify_customer_segments Goto Github PK

View Code? Open in Web Editor NEW
1.0 3.0 0.0 733 KB

Apply unsupervised learning techniques to identify customers segments.

Jupyter Notebook 43.57% HTML 56.43%
udacity-data-science-nanodegree unsupervised-learning machine-learning-algorithms cluster identify-customers-segments imputation standardscaler missing-data-imputation missing-data feature-engineering

dsnd-term1-identify_customer_segments's Introduction

Identify Customer Segments

Table of Contents

  1. Project Motivation
  2. File Descriptions
  3. Results
  4. Required Libraries
  5. Licensing, Authors, Acknowledgements

Project Motivation

In this project, I applied unsupervised learning techniques on demographic and spending data for a sample of German households. I preprocessed the data, applied dimensionality reduction techniques, and implemented k-means clustering algorithms to segment customers with the goal of optimizing customer outreach for a mail order company.

File Descriptions

Identify_Customer_Segments.ipynb and Identify_Customer_Segments.html: a Jupyter notebook and an HTML file, contain the whole project code to identify customer segments.

Results

The graph below compares the proportion of the data in each cluster for the customer data to the general population data,

the proportion of the data in each cluster for the customer data to the general population data

As a result, cluster 8 is popular with the mail-order company, while cluster 4 is unpopular with the company.

Most people of cluster 8 have the following characteristics:

  • males aged more than 45 years old
  • very high money-savers, investors, and inconspicuous
  • conservatives neighborhood
  • average share of unemployment in community

Most people of cluster 4 have the following characteristics:

  • females aged more than 30 years old
  • very high low financial interest and be prepared
  • traditional workers neighborhood
  • high Share of unemployment in community

Required Libraries

  • Pandas, NumPy, Scikit-learn (Machine Learning Libraries)
  • Matplotlib and Seaborn (Data Visualization Libraries)
  • random

Licensing, Authors, Acknowledgements

Credits must be given to Udacity and Arvato Bartlesmann for providing the data and design for this project.

dsnd-term1-identify_customer_segments's People

Contributors

manaralharbi avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.