GitHub user brennonyork opened a pull request:

    https://github.com/apache/spark/pull/5015

    [SPARK-4600][GraphX]: org.apache.spark.graphx.VertexRDD.diff does not work

    Turns out, per the [convo on the 
JIRA](https://issues.apache.org/jira/browse/SPARK-4600), `diff` is acting 
exactly as should. It became a large misconception as I thought it meant set 
difference, when in fact it does not. To that extent I merely updated the 
`diff` documentation to, hopefully, better reflect its true intentions moving 
forward.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/brennonyork/spark SPARK-4600

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/5015.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #5015
    
----
commit aa57f832e074d0285fdcdda35ea10b21625fe8e0
Author: Brennon York <[email protected]>
Date:   2015-01-29T20:32:18Z

    updated to set ShortestPaths to run 'forward' rather than 'backward'

commit 710a21c187b0ac9251431e2f8f0576280c84014a
Author: Brennon York <[email protected]>
Date:   2015-02-09T18:47:26Z

    working diff given test case

commit 9717120e0a387ca9524c0134baf650eb31033d36
Author: Brennon York <[email protected]>
Date:   2015-02-13T00:08:22Z

    updated diff impl to cause fewer objects to be created

commit 74b8c95c82cf9d8caaf1e3765d35c24fea4d98ed
Author: Brennon York <[email protected]>
Date:   2015-02-20T20:12:32Z

    corrected  method by leveraging bitmask operations to correctly return only 
the portions of  that are different from the calling VertexRDD

commit 99ad41260a97671844a2d1ecb7f4175e80fa04f7
Author: Brennon York <[email protected]>
Date:   2015-02-25T22:20:41Z

    Merge remote-tracking branch 'upstream/master' into SPARK-4600

commit 66818b901243ee6c21cc885949c0de578be3c687
Author: Brennon York <[email protected]>
Date:   2015-02-25T23:04:27Z

    added small secondary diff test

commit cc16d65bf0cdd144bd7816be98a7b686e0782dc4
Author: Brennon York <[email protected]>
Date:   2015-03-13T16:07:25Z

    Merge remote-tracking branch 'upstream/master' into SPARK-4600

commit f42862387c56bb1eff7169865c75c1dab7d3dc2e
Author: Brennon York <[email protected]>
Date:   2015-03-13T16:09:49Z

    updated diff documentation to better represent its function

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to