This project was conducted in Jupyter Notebook with Python mlenv. This project included 4 parts. The first part was preparing the data. Null values were found and dropped. All values were converted to floats and certain columns were dropped. To create two dataframes.
In the second part of the project, the data dimensions were reduced by PCA. A new dataframe pcs_df was created with 3 compononents the data was reduced into by PCA. The third part included clustering the cryptocurrencies using K-Means. This included plotting an elbow curve to determine the best value of k.
Looking at the plot above it appears k should be 4. The K-Means was run with 4 clusters as shown below.
Then two dataframes were concatenated to set up for the fourth part of the project.
Part 4 included plotting a 3-D scatterplot, hvplot table, scaling the data, and a 2-D scatter plot. Images below.
Overall the scatterplot was concentrated in the corner with TotalCoinsMined(X) less than 0.2 and TotalCoinsSupply(Y) less than 0.4. There are two notable outliers. Otherwise data seems to be consistent.