[ 
https://issues.apache.org/jira/browse/SPARK-8587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600568#comment-14600568
 ] 

Rakesh Chalasani commented on SPARK-8587:
-----------------------------------------

Sure, I can add this on the KMeans pipelines, whenever thats get added ( I will 
watch out for it).

On a slightly different topic that can help in our own development, since we 
are more inclined here to add these features to ML Pipelines over MLlib, 
eventually will MLlib won't be supported and future development going to happen 
more on Pipeline API alone? Thanks.



> Return cost and cluster index KMeansModel.predict
> -------------------------------------------------
>
>                 Key: SPARK-8587
>                 URL: https://issues.apache.org/jira/browse/SPARK-8587
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>            Reporter: Sam Stoelinga
>            Priority: Minor
>
> Looking at PySpark the implementation of KMeansModel.predict 
> https://github.com/apache/spark/blob/master/python/pyspark/mllib/clustering.py#L102
>  : 
> Currently:
> it calculates the cost of the closest cluster and returns the index only.
> My expectation:
> Easy way to let the same function or a new function to return the cost with 
> the index.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to