[jira] [Updated] (SPARK-17836) Use cross validation to determine the number of clusters for EM or KMeans algorithms

2019-05-20 Thread Hyukjin Kwon (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-17836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon updated SPARK-17836:
-
Labels: bulk-closed  (was: )

> Use cross validation to determine the number of clusters for EM or KMeans 
> algorithms
> 
>
> Key: SPARK-17836
> URL: https://issues.apache.org/jira/browse/SPARK-17836
> Project: Spark
>  Issue Type: New Feature
>  Components: ML
>Reporter: Lei Wang
>Priority: Minor
>  Labels: bulk-closed
>
> Sometimes it's not easy for users to determine number of clusters.
> It would be very useful If spark ml can support this. 
> There are several methods to do this according to wiki 
> https://en.wikipedia.org/wiki/Determining_the_number_of_clusters_in_a_data_set
> Weka uses cross validation.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-17836) Use cross validation to determine the number of clusters for EM or KMeans algorithms

2016-10-08 Thread Lei Wang (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-17836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Wang updated SPARK-17836:
-
Description: 
Sometimes it's not easy for users to determine number of clusters.
It would be very useful If spark ml can support this. 
There are several methods to do this according to wiki 
https://en.wikipedia.org/wiki/Determining_the_number_of_clusters_in_a_data_set
Weka uses cross validation.

  was:
Sometimes it's not easy for users to determine number of clusters.
It would be very useful If spark ml can support this. 
There are several methods to do this according to wiki 
https://en.wikipedia.org/wiki/Determining_the_number_of_clusters_in_a_data_set
Weka uses crossing validation.


> Use cross validation to determine the number of clusters for EM or KMeans 
> algorithms
> 
>
> Key: SPARK-17836
> URL: https://issues.apache.org/jira/browse/SPARK-17836
> Project: Spark
>  Issue Type: New Feature
>  Components: ML
>Reporter: Lei Wang
>Priority: Minor
>
> Sometimes it's not easy for users to determine number of clusters.
> It would be very useful If spark ml can support this. 
> There are several methods to do this according to wiki 
> https://en.wikipedia.org/wiki/Determining_the_number_of_clusters_in_a_data_set
> Weka uses cross validation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-17836) Use cross validation to determine the number of clusters for EM or KMeans algorithms

2016-10-08 Thread Sean Owen (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-17836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen updated SPARK-17836:
--
Priority: Minor  (was: Major)

> Use cross validation to determine the number of clusters for EM or KMeans 
> algorithms
> 
>
> Key: SPARK-17836
> URL: https://issues.apache.org/jira/browse/SPARK-17836
> Project: Spark
>  Issue Type: New Feature
>  Components: ML
>Reporter: Lei Wang
>Priority: Minor
>
> Sometimes it's not easy for users to determine number of clusters.
> It would be very useful If spark ml can support this. 
> There are several methods to do this according to wiki 
> https://en.wikipedia.org/wiki/Determining_the_number_of_clusters_in_a_data_set
> Weka uses crossing validation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-17836) Use cross validation to determine the number of clusters for EM or KMeans algorithms

2016-10-08 Thread Lei Wang (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-17836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Wang updated SPARK-17836:
-
Issue Type: New Feature  (was: Bug)

> Use cross validation to determine the number of clusters for EM or KMeans 
> algorithms
> 
>
> Key: SPARK-17836
> URL: https://issues.apache.org/jira/browse/SPARK-17836
> Project: Spark
>  Issue Type: New Feature
>  Components: ML
>Reporter: Lei Wang
>
> Sometimes it's not easy for users to determine number of clusters.
> It would be very useful If spark ml can support this. 
> There are several methods to do this according to wiki 
> https://en.wikipedia.org/wiki/Determining_the_number_of_clusters_in_a_data_set
> Weka uses crossing validation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org