GitHub user NarineK opened a pull request:

    https://github.com/apache/spark/pull/13760

    [SPARK-16012][SparkR] GapplyCollect - applies a R function to each group 
similar to gapply and collects the result back to R data.frame

    ## What changes were proposed in this pull request?
    gapplyCollect() does gapply() on a SparkDataFrame and collect the result 
back to R. Compared to gapply() + collect(), gapplyCollect() offers performance 
optimization as well as programming convenience, as no schema is needed to be 
provided.
    
    This is similar to dapplyCollect().
    
    ## How was this patch tested?
    Added test cases for gapplyCollect similar to dapplyCollect


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/NarineK/spark gapplyCollect

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/13760.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #13760
    
----
commit ea31820c9501d1f8cba96bc7f8e0fab04e9af0a2
Author: Narine Kokhlikyan <[email protected]>
Date:   2016-06-17T10:51:15Z

    initial version of gapplyCollect

commit f8e54dc265ad0eb66a26508bc5221606a9652e22
Author: Narine Kokhlikyan <[email protected]>
Date:   2016-06-17T11:00:05Z

    merged with master

commit 591c4804764cdce67d22ce52ec38c74f246e738b
Author: Narine Kokhlikyan <[email protected]>
Date:   2016-06-17T11:03:04Z

    revert .gitignore

commit 37b633afdff46374d983d204f13c05c769c7f40e
Author: Narine Kokhlikyan <[email protected]>
Date:   2016-06-18T08:59:05Z

    added test cases + improved the code

commit de5dbb0be0a3fcc42096a10470c543eaf7aa6d5c
Author: Narine Kokhlikyan <[email protected]>
Date:   2016-06-18T09:05:57Z

    fixed test case

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to