Dear all, the random initialization works well, but the default initialization is k-means|| and has made me struggle. Also, I had heard people one year ago struggling with it too, and everybody would just skip it and use random, but I cannot keep it inside me!
I have posted a minimal example here <http://stackoverflow.com/questions/39260820/is-sparks-kmeans-unable-to-handle-bigdata> .. Please advice, George Samaras