[ https://issues.apache.org/jira/browse/SPARK-14594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15253229#comment-15253229 ]
Felix Cheung commented on SPARK-14594: -------------------------------------- Not sure if this was specific to the data types of "the_table", but yea, it works if I try` {code} > df <- createDataFrame(sqlContext, iris) > rdd<-SparkR:::toRDD(df) > gb<-SparkR:::groupByKey(rdd, 1000) > first(gb) [[1]] [1] 4.3 [[2]] [[2]][[1]] [1] 3 {code} perhaps try {code} t <- table(sqlContext, 'the_table') printSchema(t) {code} And see what it looks like? Also, is "the_table" from the hive context? > Improve error messages for RDD API > ---------------------------------- > > Key: SPARK-14594 > URL: https://issues.apache.org/jira/browse/SPARK-14594 > Project: Spark > Issue Type: Improvement > Components: SparkR > Affects Versions: 1.5.2 > Reporter: Marco Gaido > > When you have an error in your R code using the RDD API, you always get as > error message: > Error in if (returnStatus != 0) { : argument is of length zero > This is not very useful and I think it might be better to catch the R > exception and show it instead. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org