Hey data engineers!
I’m using the K-means algorithm to perform clustering on a dataset. What is the most suitable visualization for a highly heterogeneous dataset, where there is a high concentration in a few agents and almost no concentration in many others?
I’m considering options such as scatter plot, boxplot, violin plot, UMAP. Which of these works best for this type of distribution?