Github user chenghao-intel commented on the pull request:
https://github.com/apache/spark/pull/3362#issuecomment-63743881
Semantically, I don't think the outer join can be done via broadcast the
small table in some cases.
For example:
Let's say right table is the small table (broadcasted) and the join type is
`RightOuterJoin`, as any single partition in Left side, doesn't have the
`equikey`, but the `equikey` does exist in the other partition, however, it
still will outputs the tuple with left side references as null, which is
supposed to output nothing, since the other partition will output the correct
tuple for this.
But if it's the `LeftOuterJoin` with right table broadcasted, it probably
works.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]