Simon.J created SPARK-20634:
-------------------------------

             Summary: result of MLlib KMeans cluster is not stabilize
                 Key: SPARK-20634
                 URL: https://issues.apache.org/jira/browse/SPARK-20634
             Project: Spark
          Issue Type: Bug
          Components: MLlib
    Affects Versions: 2.0.2
         Environment: Windows 10
spark 2.0.2 standalone
spyder 3.1.4
Anaconda 4.3.0
python 3.5.2
            Reporter: Simon.J
            Priority: Critical


1.Get a DataFrame through python with Cx_Oracle lib.
2.Start a local Spark Session.
3.Convert the dataset for Kmeansmodel train.
4.Train the KMeans model and predict the same data.just set K =3
5.Get the ClassifierFeature of the KMeans model'predict.
6.Get the count of every ClassifierFeature.
7.Loop 4-6 for 20 times.
8.Compare the result of every time.
9.Find the KMeans result dose not stabilize.
10.The same dataset and param for ML package'KMeans, its result is the same.




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to