Github user cloud-fan commented on the issue:
https://github.com/apache/spark/pull/22326
IIUC, you are pulling out the join condition with python UDF and create a
filter above join. Then the join become a cross join, which usually runs very
slowly. I think we should keep the cross join check for this case.--- --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
