Wan Kun created SPARK-41940:
-------------------------------

             Summary: Infer IsNotNull constraints for complex expression join 
keys
                 Key: SPARK-41940
                 URL: https://issues.apache.org/jira/browse/SPARK-41940
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 3.4.0
            Reporter: Wan Kun


Infer IsNotNull constraints for complex join expressions could help filter a 
lot of rows before join.
For example, 
{code:sql}
CREATE TABLE t(id int, text string) using parquet;

SELECT *
FROM t as t1
LEFT JOIN t as t2
ON t1.text = get_json_object(t2.text, '$.not_exists_col');
{code}
we can infer *IsNotNull(get_json_object(t2.text, '$.not_exists_col'))* 
constraint alone with *IsNotNull(t2.text)* to filter a lot of rows for the 
right side of the join.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to