[ 
https://issues.apache.org/jira/browse/SPARK-14594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15251507#comment-15251507
 ] 

Marco Gaido commented on SPARK-14594:
-------------------------------------

The code is quite simple, what I can't give you is the data since it's on the 
cluster of a client and I can't access it:


    df <- sql(sqlContext, "select id, data, time from the_table")
    rdd<-SparkR:::toRDD(df)
    gb<-SparkR:::groupByKey(rdd, 1000)
    ... //here you can do any action and it will crash with the above error

If you can generate by yourself some fake data I think it will be fine anyway...

> Improve error messages for RDD API
> ----------------------------------
>
>                 Key: SPARK-14594
>                 URL: https://issues.apache.org/jira/browse/SPARK-14594
>             Project: Spark
>          Issue Type: Improvement
>          Components: SparkR
>    Affects Versions: 1.5.2
>            Reporter: Marco Gaido
>
> When you have an error in your R code using the RDD API, you always get as 
> error message:
>     Error in if (returnStatus != 0) { : argument is of length zero
> This is not very useful and I think it might be better to catch the R 
> exception and show it instead.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to