You are a Senior Manager at the Advisory Services team on a Big Four firm. One of your most important clients, a prominent investment bank, is interested in offering a new cryptocurrencies investment portfolio for its customers, however, they are lost in the immense universe of cryptocurrencies. They ask you to help them make sense of it all by generating a report of what cryptocurrencies are available on the trading market and how they can be grouped using classification.
In this homework assignment, you will put your new unsupervivsed learning and Amazon SageMaker skills into action by clustering cryptocurrencies and creating plots to present your results.
You are asked to accomplish the following main tasks:
-
Data Preprocessing: Prepare data for dimension reduction with PCA and clustering using K-Means.
-
Reducing Data Dimensions Using PCA: Reduce data dimension using the
PCA
algorithm fromsklearn
. -
Clustering Cryptocurrencies Using K-Means: Predict clusters using the cryptocurrencies data using the
KMeans
algorithm fromsklearn
. -
Visualizing Results: Create some plots and data tables to present your results.
-
Optional Challenge: Deploy your notebook to Amazon SageMaker.
Clustering Cryptocurrencies Using K-Means
- Elbow Curve
Elbow curve is used to calculate the WSS for different values of K, and select the best value of K when WSS starts to diminish. In this instance, the plot looks like a clear elbow at k = 4.
- 3D-Scatter with the PCA data and the clusters
- Scatter Plot with Tradable Cryptocurrencies