[ https://issues.apache.org/jira/browse/SPARK-14147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15211373#comment-15211373 ]
Sun Rui commented on SPARK-14147: --------------------------------- I am not familiar with ML. my question is that is there any reason that data of vector type be collected back to R side and how will it be manipulated? Can ML be solely done on Scala side? and do you know any existing R data type that can map to the vector type? > SparkR - ML predictors return features with vector datatype, however SparkR > doesn't support it > ---------------------------------------------------------------------------------------------- > > Key: SPARK-14147 > URL: https://issues.apache.org/jira/browse/SPARK-14147 > Project: Spark > Issue Type: Bug > Components: SparkR > Reporter: Narine Kokhlikyan > > It seems that ML predictors in SparkR return an output which contains > features represented with vector datatype, however SparkR doesn't support it > and as a result features are being displayed as an environment variable. > example: > prediction <- predict(model, training) > DataFrame[Sepal_Length:double, Sepal_Width:double, Petal_Length:double, > Petal_Width:double, features:vector, prediction:int] > collect(prediction) > Sepal_Length Sepal_Width Petal_Length Petal_Width > features prediction > 1 5.1 3.5 1.4 0.2 <environment: > 0x10b7a8870> 1 > 2 4.9 3.0 1.4 0.2 <environment: > 0x10b79d498> 1 > 3 4.7 3.2 1.3 0.2 <environment: > 0x10b7960a8> 1 -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org