Github user tejasapatil commented on the issue:
https://github.com/apache/spark/pull/18692
@cloud-fan : In event when the (set of join keys) is a superset of (child
node's partitioning keys), its possible to avoid shuffle :
https://github.com/apache/spark/pull/19054 ... this can help with 2 cases -
when users unknowingly join over extra columns in addition to bucket columns
- the one you mentioned (ie. inferred conditions).
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]