[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

Felix Cheung (JIRA) Tue, 06 Oct 2015 10:48:17 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14945446#comment-14945446
 ]


Felix Cheung commented on SPARK-9325:
-------------------------------------

My comment was on collect(df$Age). As I've stated, collect(select(df, df$Age)) 
would work.

Russell, in fact, please take a look at subset support added recently
http://spark.apache.org/docs/latest/api/R/index.html

You would be able to do something like collect(df[df$age %in% c(19, 30), 1:2]) 
to get a R data.frame.

> Support `collect` on DataFrame columns
> --------------------------------------
>
>                 Key: SPARK-9325
>                 URL: https://issues.apache.org/jira/browse/SPARK-9325
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SparkR
>            Reporter: Shivaram Venkataraman
>
> This is to support code of the form 
> ```
> ages <- collect(df$Age)
> ```
> Right now `df$Age` returns a Column, which has no functions supported.
> Similarly we might consider supporting `head(df$Age)` etc.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

Reply via email to