Github user xuanyuanking commented on the issue:

    https://github.com/apache/spark/pull/22326
  
    ```
    move this rule to optimizer, as the last batch (but before the 
UpdateAttributeReferences batch). Since we apply this rule after filter 
pushdown, we can simply pull out any python udf in join condition. Also add 
this rule to Optimizer.nonExcludableRules, since this is a special optimizer 
rule that can't be turned off.
    ```
    Make sense, implement like this can also avoid breaking the assumption in 
Dataset.join of Join plan only return Join after analysis. I'll reimplement as 
this proposal soon. 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to