GitHub user adrian-wang opened a pull request:
https://github.com/apache/spark/pull/1049
Clean left semi join hash
Some improvement for PR #837, add another case to white list and use
`filter` to build result iterator.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/adrian-wang/spark clean-LeftSemiJoinHash
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/1049.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #1049
----
commit 27579a9223885356b1bb3f726f8752115cc1131a
Author: Daoyuan <[email protected]>
Date: 2014-06-11T06:42:16Z
add semijoin to white list and use filter to create new iterator in
LeftSemiJoinBNL
commit b314d5ad42fab41fd68c5ba183e93165635be864
Author: Daoyuan <[email protected]>
Date: 2014-06-11T07:19:00Z
change hashSet name
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---