GitHub user viirya opened a pull request:

    https://github.com/apache/spark/pull/1293

    [SPARK-2355] Add checker for the number of clusters

    When the number of clusters given to perform with 
org.apache.spark.mllib.clustering.KMeans under parallel initial mode is greater 
than data number, it will throw ArrayIndexOutOfBoundsException.
    
    This PR adds checker for the number of clusters and throws 
IllegalArgumentException when that number is greater than data number.
    


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/viirya/spark-1 check_clusters_number

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/1293.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #1293
    
----
commit 582cd11e5331a8e2704a5603080eec41c9002cf4
Author: Liang-Chi Hsieh <[email protected]>
Date:   2014-07-03T16:27:22Z

    simply add checker for the number of clusters.

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

Reply via email to