Simon.J created SPARK-20634:
-------------------------------
Summary: result of MLlib KMeans cluster is not stabilize
Key: SPARK-20634
URL: https://issues.apache.org/jira/browse/SPARK-20634
Project: Spark
Issue Type: Bug
Components: MLlib
Affects Versions: 2.0.2
Environment: Windows 10
spark 2.0.2 standalone
spyder 3.1.4
Anaconda 4.3.0
python 3.5.2
Reporter: Simon.J
Priority: Critical
1.Get a DataFrame through python with Cx_Oracle lib.
2.Start a local Spark Session.
3.Convert the dataset for Kmeansmodel train.
4.Train the KMeans model and predict the same data.just set K =3
5.Get the ClassifierFeature of the KMeans model'predict.
6.Get the count of every ClassifierFeature.
7.Loop 4-6 for 20 times.
8.Compare the result of every time.
9.Find the KMeans result dose not stabilize.
10.The same dataset and param for ML package'KMeans, its result is the same.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]