GitHub user abehrens opened a pull request:

    https://github.com/apache/spark/pull/10710

    [pyspark] adding disjunction and difference functions for rdds

    I was looking for a way to perform disjunction and difference operations, 
in other words:
    
    * disjunction: find all elements NOT in A and B 
    * difference: find all elements in A but NOT in B
    
    If there are better names for these functions, I'd be happy to change them. 
 I am a pyspark user so they are only in the python code.  I would love having 
them in Scala and Java too.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/abehrens/spark disjunction

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/10710.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #10710
    
----
commit e4cc54b0d22b11d4c902c05d25403d87080ff960
Author: Alex Behrens <[email protected]>
Date:   2016-01-11T21:44:58Z

    [pyspark] adding disjunction and difference functions for rdds

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to