[GitHub] spark issue #21083: [SPARK-21479][SPARK-23564][SQL] infer additional filters...

maryannxue Tue, 17 Apr 2018 12:46:39 -0700

Github user maryannxue commented on the issue:

    https://github.com/apache/spark/pull/21083
  
    Thank you for you reply, @cloud-fan! I was not clear when you had become 
aware of the effort on SPARK-21479 so it might be a misunderstanding on my side 
and I apologize. Anyway, if you had had a closer look at the PR, you would have 
probably got the idea that it's basically the same approach as what you have 
here, only that you have covered more join types.
    Here's another note. There's two types of constraint-to-filter inference 
for joins going on here:
    1. Infer from the Join node constraints, which is covered by the 
`PushPredicateThroughJoin` rule;
    2. Infer from the sibling child node combined with the join condition, 
which is what you've added here.
    That said, the InnerLike joins should already be covered by 1 and might not 
be worth being considered again in this optimization rule. Not sure about 
LeftSemi joins, so it would be nice if we could have a test case that proves 
this optimization does something that has not yet been covered by 
`PushPredicateThroughJoin` rule.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #21083: [SPARK-21479][SPARK-23564][SQL] infer additional filters...

Reply via email to