问题描述:

What are good model evaluation/performance metrics for KMeans Clustering algorithm. Spark MLLib has Within Set Sum of Squared Error (WSSSE) and we an also compute MeanWSSSE. However both these metrics are lower with increasing cluster size. How do you evaluate preferred cluster size for your model?

相关阅读:
Top