Is Spark's KMeans unable to handle bigdata?

Georgios Samaras Thu, 01 Sep 2016 15:36:06 -0700

Dear all,

  the random initialization works well, but the default initialization is
k-means|| and has made me struggle. Also, I had heard people one year ago
struggling with it too, and everybody would just skip it and use random,
but I cannot keep it inside me!


  I have posted a minimal example here
<http://stackoverflow.com/questions/39260820/is-sparks-kmeans-unable-to-handle-bigdata>
..

Please advice,
George Samaras

Is Spark's KMeans unable to handle bigdata?

Reply via email to