[ 
https://issues.apache.org/jira/browse/SPARK-14147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15211384#comment-15211384
 ] 

Narine Kokhlikyan edited comment on SPARK-14147 at 3/25/16 3:51 AM:
--------------------------------------------------------------------

This happens when we call transform on PipelineModel. Scala datatype is being 
mapped to SparkR datatype.
dataFrame(callJMethod(object@model, "transform", newData@sdf)

Maybe we can map it to an array ?

[~yanboliang], do you think we can change the datatype mapping ?

This happens both to GLM and Kmeans


was (Author: narine):
This happens when we call transform on PipelineModel. Scala datatype is being 
mapped to SparkR datatype.
dataFrame(callJMethod(object@model, "transform", newData@sdf)

Maybe we can map it to an array ?

[~yanboliang], do you think we can change the datatype mapping ?

> SparkR - ML predictors return features with vector datatype, however SparkR 
> doesn't support it
> ----------------------------------------------------------------------------------------------
>
>                 Key: SPARK-14147
>                 URL: https://issues.apache.org/jira/browse/SPARK-14147
>             Project: Spark
>          Issue Type: Bug
>          Components: SparkR
>            Reporter: Narine Kokhlikyan
>
> It seems that ML predictors in SparkR return an output which contains 
> features represented by vector datatype, however SparkR doesn't support it 
> and as a result features are being displayed as an environment variable.
> example: 
> prediction <- predict(model, training)
> DataFrame[Sepal_Length:double, Sepal_Width:double, Petal_Length:double, 
> Petal_Width:double, features:vector, prediction:int]
> collect(prediction)
>     Sepal_Length Sepal_Width Petal_Length Petal_Width                   
> features prediction
> 1            5.1         3.5          1.4         0.2 <environment: 
> 0x10b7a8870>          1
> 2            4.9         3.0          1.4         0.2 <environment: 
> 0x10b79d498>          1
> 3            4.7         3.2          1.3         0.2 <environment: 
> 0x10b7960a8>          1



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to