[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14945446#comment-14945446 ]
Felix Cheung commented on SPARK-9325: ------------------------------------- My comment was on collect(df$Age). As I've stated, collect(select(df, df$Age)) would work. Russell, in fact, please take a look at subset support added recently http://spark.apache.org/docs/latest/api/R/index.html You would be able to do something like collect(df[df$age %in% c(19, 30), 1:2]) to get a R data.frame. > Support `collect` on DataFrame columns > -------------------------------------- > > Key: SPARK-9325 > URL: https://issues.apache.org/jira/browse/SPARK-9325 > Project: Spark > Issue Type: Sub-task > Components: SparkR > Reporter: Shivaram Venkataraman > > This is to support code of the form > ``` > ages <- collect(df$Age) > ``` > Right now `df$Age` returns a Column, which has no functions supported. > Similarly we might consider supporting `head(df$Age)` etc. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org