[
https://issues.apache.org/jira/browse/SPARK-14594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15253229#comment-15253229
]
Felix Cheung commented on SPARK-14594:
--------------------------------------
Not sure if this was specific to the data types of "the_table", but yea, it
works if I try`
{code}
> df <- createDataFrame(sqlContext, iris)
> rdd<-SparkR:::toRDD(df)
> gb<-SparkR:::groupByKey(rdd, 1000)
> first(gb)
[[1]]
[1] 4.3
[[2]]
[[2]][[1]]
[1] 3
{code}
perhaps try
{code}
t <- table(sqlContext, 'the_table')
printSchema(t)
{code}
And see what it looks like? Also, is "the_table" from the hive context?
> Improve error messages for RDD API
> ----------------------------------
>
> Key: SPARK-14594
> URL: https://issues.apache.org/jira/browse/SPARK-14594
> Project: Spark
> Issue Type: Improvement
> Components: SparkR
> Affects Versions: 1.5.2
> Reporter: Marco Gaido
>
> When you have an error in your R code using the RDD API, you always get as
> error message:
> Error in if (returnStatus != 0) { : argument is of length zero
> This is not very useful and I think it might be better to catch the R
> exception and show it instead.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]